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@ Recombinant polypeptides and peptides, nucleic acids coding for the same and use of these 
polypeptides and peptides in the diagnostic of tuberculosis. 

© The invention relates to recombinant polypeptides and peptides and particularly to the polypeptide contain- 
ing in its polypeptidic chain the following amino acid sequence: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted by 
nucleotide at position (194) represented in fig. 4a and fig. 4b. 

The polypeptides and peptides of the invention can be used for the diagnostic of tuberculosis, and can also 
be part of the active principle in the preparation of vaccine against tuberculosis. 
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RECOMBINANT POLYPEPTIDES AND PEPTIDES, NUCLEIC ACIDS CODING FOR THE SAME AND USE OF 
THESE POLYPEPTIDES AND PEPTIDES IN THE DIAGNOSTIC OF TUBERCULOSIS 

The invention relates to recombinant polypeptides and peptides, which can be used for the diagnosis of 
tuberculosis. The invention also relates to a process for preparing the above-said polypeptides and 
peptides, which are in a state of biological purity such that they can be used as part of the active principle 
in the preparation, of vaccines against tuberculosis. 
5 It also relates to nucleic acids coding for said polypeptides and peptides. 

Furthermore, the invention relates to the in vitro diagnostic methods and kits using the above-said 
polypeptides and peptides and to the vaccines' containing the above-said polypeptides and peptides as 
active principle against tuberculosis. 

By "recombinant polypeptides or peptides" it is to be understood that it relates to any molecule having 
TO a polypeptidic chain liable to be produced by genetic engineering, through transcription and translation, of a 
corresponding DNA sequence under the control of appropriate regulation elements within an efficient 
cellular host. Consequently, the expression "recombinant polypeptides" such as is used herein does not 
exclude the possibility for the polypeptides to comprise other groups, such as glycosylated groups. 

The term "recombinant" indeed involves the fact that the polypeptide has been produced by genetic 
75 engineering, particularly because it results from the expression in 'a cellular host of the corresponding 
nucleic acid sequences which have previously been introduced into the expression vector used in said host- 
Nevertheless, it must be understood that this expression does not exclude the possibility for the 
polypeptide to be produced by a different process, for instance by classical chemical synthesis according 
to methods used in the protein synthesis or by proteolytic cleavage of larger molecules. 
20 The expression "biologically pure" or "biological purity" means on the one hand a grade of purity such 
that the recombinant polypeptide can be used for the production of vaccinating compositions and on the 
other hand the absence of contaminants, more particularly of natural contaminants. 

Tuberculosis remains a major disease in developing countries. The situation is dramatic in some 
countries, particularly where high incidence of tuberculosis among AIDS patients represents a new source 
25 of dissemination of the disease. 

Tuberculosis is a chronic infectious disease in which cell-mediated immune mechanisms play an 
essential role both for protection against and control of the disease. 

Despite BOG vaccination, and some effective drugs, tuberculosis remains a major global problem. Skin 
testing with tuberculin PPD (protein-purified derivative) largely used for screening of the disease is poorly 
30 specific, due to cross reactivity with other pathogenic or environmental saprophytic mycobacteria. 

Moreover, tuberculin PPD when used in serological tests (ELISA) does not allow to discriminate 
between patients who have been vaccinated by BOG, or those who have been primo-infected, from those 
who are developing evolutive tuberculosis and for whom an early and rapid diagnosis would be necessary. 
A protein with a molecular weight of 32-kPa has been purified (9) from zinc deficient Mycobacterium 
35 bovis BOG culture filtrate (8). This 32-kDa protein of M. bovis BOG has been purified from Sauton zinc 
deficient culture filtrate of M. bovis BOG using succ'essively hydrophobic chromatography on Phenyl- 
Sepharose, ion exchange on DEAE-Sephacel and molecular sieving on Sephadex G-100. The final 
preparation has been found to be homogeneous as based on several analyses. This P32 protein is a 
constituent of BCG cells grown in normal conditions. It represents about 3% of the soluble fraction of a 
40 cellular extract, and appears as the major protein released in normal Sauton culture filtrate. This protein has 
been found to have a molecular weight of 32 000 by SDS-polyacrylamide get electrophoresis and by 
molecular sieving. 

The NH2-terminal amino acid sequence of the 32-kDa protein of M. bovis BCG (Phe-Ser-Arg-Pro-Gly- 
Leu) is identical to that reported for the MPB 59 protein purified from M. bovis BCG substrain Tokyo (34). 

45 Purified P32 of M. bovis BCG has been tested by various cross immunoelectrophoresis techniques, and 
has been shown to belong to the antigen 85 complex in the reference system for BCG antigens. It has been 
more precisely identified as antigen 85A in the doss reference system for BCG antigens (7). 

Increased levels of immunoglobulin G antibodies towards the 32-kDa protein of M. bovis BCG could be 
detected in 70% of tuberculous patients (30). 

50 Furthermore, the 32-kDa protein of M. bovis BCG induces specific lymphoproliferation and interferon- 
(IFN-7) production in peripheral blood leucocytes from patients with active tuberculosis (12) and PPD- 
positive healthy subjects. Recent findings indicate that the amount of 32-kDa protein of M. bovis BCG- 
induced IFN-7 in BCG-sensitized mouse spleen cells is under probable H-2 control (13). Finally, the high 
affinity of mycobacteria for fibronectin is related to proteins of the BCG 85 antigen complex (1). 
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Matsuo et al. (17) recently cloned the gene encoding the antigen a. a major protein secreted by BCG 
(substrain Tokyo) and highly homologous to MPB 59 antigen in its NHz-terminal amino acid sequence, and 
even identical for its first 6 amino acids : Phe-Ser-Arg-Pro-Gly-Leu. 

This gene was cloned by using a nucleotide probe homologous to the N-terminal ammo acid sequence 
of antigen a, purified from M. tuberculosis as described in Tasaka. H. et al.. 1983. "Purification and 
antigenic specificity of alpha "protein (Yoneda and Fukui) from Mycobacterium tuberculosis and Mycobac- 
terium intracellulare. Hiroshima J. Med. Sci. 32 . 1-8. 

The presence of antigens of around 30-32-kDa, named antigen 85 complex, has been revealed from 
electrophoretic patterns of proteins originating from culture media of mycobacteria, such as Mycobacterium 
tuberculosis. By immunoblotting techniques, it has been shown that these antigens cross-react with rabbit 
sera raised against the 32-kDa protein of BCG (8). 

A recent study reported on the preferential humoral response to a 30-kDa and 31-kDa antigen m 
lepromatous leprosy patients, and to a 32-kDa antigen in tuberculoid leprosy patients (24). 

It has also been found that fibronectin (FN)-binding antigens are prominent components of short-term 
culture supematants of Mycobacterium tuberculosis. In 3-day-old supernatants. a 30-kilodalton (kDa) protein 
was identified as the major (FN)-binding molecule. In 21 -day-old supematants. FN was bound to a double 
protein band of around 30 to 32-kDa. as well as to a group of antigens of larger molecular mass (57 to 60 
kDa)(1). 

In other experiments, recombinant plasmids containing DNA.from Mycobacterium tuberculosis were 
transformed into Escherichia coli. and three colonies were selected by their reactivity with polyclonal 
antisera to M. tuberculosis . Each recombinant produced 35- and 53-kilodalton proteins (35K and 53K 
proteins, respectively) ("Expression of Proteins of Mycobacterium tuberculosis in Escherichia coli and 
Potential of Recombinant Genes and Proteins for Development of Diagnostic Reagents". Mitchell L Cohen 
et al.. Journal of Clinical Microbiology, July 1987, p.1 176-1 180). 

Conceming the various results known to date, the physico-chemical characteristics of the antigen P32 of 
Mycobacterium tuberculosis are not precise and, furthermore, insufficient to enable its unambiguous 
identifiability, as well as the characterization of its structural and functional elements. 

Moreover, the pathogenicity and the potentially infectious property of M. tuberculosis has hampered 
research enabling to identify, purify and characterize the constituents as well as the secretion products of 
this bacteria. 

An aspect of the invention is to provide recombinant polypeptides which can be used as puritiea 
antigens for the detection and control of tuberculosis. 

Another aspect of the invention is to provide nucleic acids coding for the peptidic chains of biologically 
pure recombinant polypeptides which enable their preparation on a large scale. 

Another aspect of the invention is to provide antigens which can be used in serological tests as an in 
vitro rapid diagnostic of tuberculosis. 

^Another aspect of the invention is to provide a rapid in vitro diagnostic means for tuberculosis, enabling 

it to discriminate between patients suffering from an evolutive tuberculosis from those who have been 
vaccinated against BOG or who have been- primo-infected. 

Another aspect of the invention is to provide nucleic probes which can be used as in vitro diagnostic 
reagent for tuberculosis, as well as in yitro diagnostic reagent for identifying M. tuberculosis from other 
strains of mycobacteria. 

The recombinant polypeptides of the invention contain in their polypeptidic chain one at least of the 
following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31 ) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 3a and fig. 3b. or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 3a and fig. 3b, or 
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- the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

and the peptidic sequences resulting from the modification by substitution and/or by addition and/or by 
deletion of .one or several amino acids in so far as this modification does not alter the following properties : 
5 the polypeptides react with rabbit polyclonal antiserum raised against the protein of 32-kDa of M. bovis 
BCG culture filtrate, and/or 

react selectively with human sera from tuberculosis patients and particularly patients developing an 
evolutive tuberculosis at an early stage, 

and/or react with the amino acid sequence extending from the extremity constituted by amino acid at 
10 position (1), to the extremity constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 
On figures 3a and 3b : 

- X represents G or GG. 

- Y represents C or CC, 

- Z represents C or G, 

75 - W represents C or G and is different from Z, 

- X represents C or CG, 

- L represents G or CG. 

- ai-bi represents ALA-ARG or GLY-ALA-ALA, 

- az represents arg or gly, 

20 - a3-b3-C3-d3-e3-f3- represents his-trp-val-pro-arg-pro or aia-leu-gly-ala. 

- at represents pro or pro-asn-thr, 

- as represents pro or ala-pro. 

The recombinant polypeptides of the invention contain in their polypeptidic chain one at least of the 
following amino acid sequences: 
25 - the one extending from the extremity constituted by amino acid at position (-29) 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (12) 
constituted by amino acid at position (31) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (36) 
30 constituted by amino acid at position (55) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (77) 
constituted by amino acid at position (96) represented on fig. 4a and fig. 4b. or 

- the one extending from .the extremity constituted by amino acid at position (101) 
constituted by amino acid at position (120) represented on fig. 4a and fig. 4b. or 

35 - the one extending from the extremity constituted by amino acid at position (175) 
constituted by amino acid at position (194) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (211) 
constituted by amino acid at position (230) represented on fig. 4a and fig. 4b. or 

- the one extending from the extremity constituted by amino acid at position (275) 
40 constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

and the peptidic sequences resulting from the modification by substitution and/or by addition and/or by 
deletion of one or several amino acids in so far as this modification does not alter the following properties : 
the polypeptides react with rabbit polyclonal antiserum raised against the protein of 32-kDa of M. bovis 
BCG culture filtrate, and/or 

45 react selectively with human sera from tuberculosis patients and particularly patients developing an 
evolutive tuberculosis at an early stage, 

and/or react with the amino acid sequence extending from the extremity constituted by amino acid at 
position (1), to the extremity constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 
The recombinant polypeptides of the invention contain in their polypeptidic chain one at least of the 
60 following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 5. or 

55 - the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 5. or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 5. or 



to the extremity 
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to the extremity 
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to the extremity 

to the extremity 

to the extremity 

to the extremity 
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- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 5. or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 5. or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 5. or 

' the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (295) represented on fig. 5, 

and the peptidic sequences resulting from the modification by substitution and/or by addition and/or by 
deletion of one or several amino acids in so far as this modification does not alter the following properties : 
the polypeptides react with rabbit polyclonal antiserum raised against the protein of 32-kDa of M. bovis 
BCG culture filtrate, and/or 

react selectively with human sera from tuberculosis patients and particularly patients developing an 
evolutive tuberculosis at an early stage, 

and/or react with the amino acid sequence extending from the extremity constituted by amino acid at 
position (1), to the extremity constituted by amino acid at position (295) represented on fig. 5. 

Advantageous polypeptides of the invention are characterized by the fact that they react with rabbit 
polyclonal antiserum raised against the protein of 32-kDa of M. bovis BCG culture filtrate, hereafter 
designated by "P32 protein of BCG". 

Advantageous polypeptides of the invention are characterized by the fact that they selectively react with 
human sera from tuberculous patients and particularly patients developing an evolutive tuberculosis at an 
early stage. 

Hereafter is given, in a non limitative way a process for preparing rabbit polyclonal antiserum raised 
against the P32 protein of BCG and a test for giving evidence of the reaction between the polypeptides of 
the invention and said rabbit polyclonal antiserum raised against the P32 protein of BCG. 

1) process for preparing rabbit polyclonal antiserum raised against the P32 protein of BCG: 
Purified P32 protein of BCG from culture filtrate is used. 



a) Purification of protein P32 of BCG 

P32 protein can be purified as follows : 
The bacterial strains used are M. bovis BCG substrains 1173P2 (Pasteur Institute. Paris) and GL2 (Pasteur 
Institute, Brussels). 

The culture of bacteria is obtained as follows : 
Mycobacterium bovis BCG is grown as a pellicle on Sauton medium, at 37.5° C for 14 days. As the medium 
is prepared with distilled water, zinc sulfate is added to the final concentration of 5 uM (normal Sauton 
medium) (De Bruyn J., Weckx M.. Beumer-Jochmans M.-P. Effect of zinc deficiency on Mycobacterium 
tuberculosis var. bovis (BCG). J. Gen. Microbiol. 1981: 124:353-7). When zinc deficient medium was 
needed, zinc sulfate is omitted. 

The filtrates from zinc deficient cultures are obtained as follows : 
The culture medium is clarified by decantation. The remaining bacteria are removed by filtration through 
Millipak 100 filter unit (Miilipore Corp., Bedford. Mass.). When used for purification, the filtrate is adjusted to 
20 mM in phosphate. 450 mM in NaCI, 1 mM in EDTA. and the pH is brought to 7.3 with 5 M HCI before 
Sterile filtration. 

The protein analysis is carried out by polyacrylamide gel electrophoresis. Sodium dodecyl sulfate 
polyacrylamide gel electrophoresis (SDS-PAGE) was done on 13% (w/v) acrylamide-containing gels as 
described by Laemmli UK. (Cleavage of structural proteins during the assembly of the head of bac- 
teriophage T4. Nature 1970; 227:680-5). The gels are stained with Coomassie Brilliant Blue R-250 and for 
quantitative analysis, scanned at 595 nm with a DUB Beckman spectrophotometer. For control of purity the 
gel is revealed with silver stain (Biorad Laboratories, Richmond, Calif.). 

The purification step of P32 is carried out as follows: 
Except for hydrophobic chromatography on Phenyl-Sepharose, all buffers contain Tween 80 (0.005% final 
concentration). The pH is adjusted to 7.3 before sterilization. All purification steps are carried out at +4 C. 
Elutions are followed by recording the absorbance at 280 nm. The fractions containing proteins are 
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analysed by SDS-PAGE. 

(i) The treated filtrate from a 4 liters zinc-deficient culture, usually containing 125 to 150 mg protein per 
liter, is applied to a column (5.0 by 5.0 cm) of Phenyl-Sepharose CL-4B (Pharmacia Fine Chemicals. 
Uppsala, Sweden), which is previously equilibrated with 20 mM phosphate buffer (PB) containing 0.45 M 
NaCI and 1 mM EDTA. at a flow rate of 800 ml per hour. The gel is then washed with one column 
volume of the same buffer to remove unfixed material and successively with 300 ml of 20 mM and 4 mM 
PB and 10% ethanol (v/v). The Pa 2 appears in the fraction eluted with 10% ethanol. 
(li) After the phosphate concentration of this fraction has been brought to 4 mM, it is applied to a column 
(2.6 by 10 cm) of DEAE-Sephacel (Pharmacia Fine Chemicals), which is equilibrated with 4 mM PB, 
After washing with the equilibrating buffer the sample is eluted with 25 mM phosphate at a flow rate of 
50 ml per hour. The eluate is concentrated in a 202 Amicon stirred cell equipped with a PM 10 
membrane (Amicon Corp.. Lexington, Mass.). 

(iii) The concentrated material is submitted to 4 mg of P32 protein of BCG (soluble extract) or molecular 
sieving on a Sephadex G-100 (Pharmacia) column (2.6 by 45 cm) equilibrated with 50 mM PB. at a flow 
rate of 12 ml per hour. The fractions of the peak giving one band In SDS-PAGE are pooled. The purity of 
the final preparation obtained is controlled by SDS-PAGE followed by silverstaining and by molecular 
sieving on a Superose 12 (Pharmacia) column (12.0 by 30 cm) equilibrated with 50 mM PB containing 
0.005% Tween 80 at a flow rate of 0.2 ml/min. in the Fast Protein Liquid Chromatography system 
(Pharmacia). Elution is followed by recording the absorbance at 280 nm and 214 nm. 

b) Preparation of rabbit polyclonal antiserum raised against the P32 protein of BCG : 

400 ug of purified P32 protein of BCG per ml physiological saline are mixed with one volume of 
incomplete Freund*s adjuvant. The material is homogenized and injected intradermally in 50 ul doses 
delivered at 10 sites in the back of the rabbits, at 0. 4, 7 and 8 weeks (adjuvant is replaced by the diluent 
for the last injection). One week later, the rabbits are bled and the sera tested for antibody level before 
being distributed in aliquots and stored at -80 ' C; 

2) test for giving evidence of the reaction between the polypeptides of the invention and said rabbit 
polyclonal antiserum raised against the P32 protein of BCG: 

the test used was an ELISA test; the ELISA for antibody determination is based on the method of 
Engvall and Perlmann (Engvall, E., and P. Perlmann. 1971. Enzyme-linked immunosorbent assay (ELISA). 
Quantitative assay of immunoglobulin C. Immunochemistry 8:871-874) 

Immulon Microelisa plates (Dynatech. Kloten, Switzerland) are coated by adding to each well 1 ug of 
one of the polypeptides of the invention in 100 ut Tris hydrochloride buffer 50 mM (pH 8.2). After incubation 
for 2 h at 27* C in a moist chamber, the plates are kept overnight at 4* C. They are washed four times with 
0.01 M phosphate-buffered saline (pH 7.2) containing 0.05% Tween 20 by using a Titertek microplate 
washer (Flow Laboratories. Brussels. Belgium). Blocking is done with 0.5% gelatin in 0.06 M carbonate 
buffer (pH 9.6) for 1 h. Wells are then washed as before, and 100 ul of above mentioned serum diluted in 
phosphate-buffered saline containing 0.05% Tween 20 and 0.5% gelatin is added. According to the results 
obtained in preliminary experiments, the working dilutions are set at 1:200 for IgG, 1:20 for IgA and 1:80 for 
IgM determinations. Each dilution is run in duplicate. After 2 h of incubation and after the wells are washed, 
they are filled with 100 ul of peroxidase-conjugated rabbit immunoglobulins directed against human IgG. 
IgA or IgM (Dakopatts. Copenhagen, Denmark), diluted 1:400. 1:400 and 1:1.200, respectively in phosphate- 
buffered saline containing 0.05% Tween 20 and 0.5% gelatin and incubated for 90 min. After the wash, the 
amount of peroxidase bound to the wells is quantified by using a freshly prepared solution of o- 
phenylenediamine (10 mg/100 ml) and hydrogen peroxide (8ul of 30% H2O2 per 100 ml) in 0.15 M citrate 
buffer (pH 5.0) as a substrate. The enzymatic reaction is stopped with 8 N H2SO4 after 15 min. of 
incubation. The optical density is read at 492 nm with a Titertek Multiskan photometer (Flow Laboratories). 

Wells without sera are used as controls for the conjugates. Each experiment is done by including on 
each plate one negative and two positive reference sera with medium and low antibody levels to correct for 
plate-to-plate and day-to-day variations. The antibody concentrations are expressed as the optical density 
values obtained after correction of the readings according to the mean variations of the reference sera. 

Hereafter is also given in a non limitative way, a test for giving evidence of the fact that polypeptides of 
the invention are recognized selectively by human sera from tuberculous patients. 
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This test is an immunoblotting (Western blotting) analysis, in the case where the polypeptides of the 
Invention are obtained by recombinant techniques. This test can also be used for polypeptides of the 
invention obtained by a different preparation process. After sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis, polypeptides of the invention are blotted onto nitrocellulose membranes (Hybond C. 

5 (Amersham)) as described by Towbin et al. (29). The expression of polypeptides of the invention fused to 
)3-galactosidase in E. coll Y1089, is visualized by the binding of a polyclonal rabbit anti-32-kDa BCG protein 
serum (1:1,000) o" by" using a monoclonal anti-i3-galactosidase antibody (Promega). The secondary 
antibody (alkaline phosphatase anti-rabbit immunoglobulin G and anti-mouse alkaline phosphatase im- 
munoglobulin G conjugates, respectively) is diluted as recommended by the supplier (Promega). 

70 In order to identify selective recognition of polypeptides of the invention and of fusion proteins of the 
invention by human tuberculous sera, nitrocellulose sheets are incubated overnight with these sera (1:50) 
(after blocking aspecific protein-binding sites). The human tuberculous sera are selected for their reactivity 
(high or low) against the purified 32-kDa antigen of BCG tested in a dot blot assay as described in 
document (31) of the bibliography hereafter. Reactive areas on the nitrocellulose sheets are revealed by. 

75 incubation with peroxidase conjugated goat anti-human immunoglobulin G antibody (Dakopatts, Copenha- 
gen, Denmark) (1:200) for 4h, and after repeated washings, color reaction is developed by adding 
peroxidase substrate (a-chloronaphtol) (Bio-Rad Laboratories, Richmond. Calif.) in the presence of perox- 
idase and hydrogen peroxide. 

It goes without saying that the free reactive functions which are present in some of the ammo acids, 

20 which are part of the constitution of the polypeptides of the invention, particularly the free carboxyl groups 
which are carried by the groups Glu or by the C-terminal amino acid on the one hand and'or the free NH2 
groups carried by the N-terminal amino acid or by amino acid inside the peptidic chain, for instance Lys. on 
the other hand, can be modified in so far as this modification does not alter the above mentioned properties 
of the polypeptide. 

25 The molecules which are thus 'modified are naturally part of the invention. The above mentioned 
carboxyl groups can be acylated or esterified. 

Other modifications are also part of the invention. Particularly, the amine or ester functions or both of 
terminal amino acids can be themselves involved in the bond with other amino acids. For instance, the N- 
terminal amino acid can be linked to a sequence comprising from 1 to several amino acids corresponding 
30 to a part of the C-terminal region of another peptide. 

Furthermore, any peptidic sequences resulting from the modification by substitution and/or by addition 
and/or by deletion of one or several amino acids of the polypeptides according to the invention are part of 
the invention in so far as this modification does not alter the above mentioned properties of said 

polypeptides. . 
35 The polypeptides according to the invention can be glycosylated or not, particularly m some of their 
glycosylation sites of the type Asn-X-Ser or Asn-X-Thr, X representing any amino acid. 

Advantageous recombinant polypeptides of the invention contain in their polypeptidic chain, one at least 
of the following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
40 constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-49) to to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

45 - the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (-1) represented on fig, 3a and fig. 3b. 

Advantageous recombinant polypeptides of the invention contain in their polypeptidic chain, one at least 
50 of the following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

55 - the one extending from the extremity constituted by amino acid at position (-49) to to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 
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- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

Advantageous recombinant polypeptides of the invention contain in their polypeptidic chain, one at least 
of the following amino acid sequences: 
5 - the one extending from the extremity constituted by amino acid at position (-43) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5. 

Advantageous recombinant polypeptides of the invention contain in their polypeptidic chain, one at least 
of the following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (1 ) to the extremity constituted 
10 by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

75 - the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
20 constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

Advantageous recombinant polypeptides of the invention contain in their polypeptidic chain, one at least 
of the following amino acid sequences: 
25 - the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
30 constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

35 - the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

Advantageous recombinant polypeptides* of the invention contain in their polypeptidic chain, one at least 
40 of the following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (295) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (295) represented on fig. 5. 

45 - the one extending from the extremity constituted by amino acid at position (-43) to the extremity 
constituted by amino acid at position (295) represented on fig. 5. 

Other advantageous recombinant polypeptides of the invention consist in one of the following amino 
acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
60 constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

55 - the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 
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- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on fig. 3a and fig. 3b. 

5 Other advantageous recombinant polypeptides of the invention consist in one of the following amino 
acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
10 constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

75 - the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (l) to the extremity constituted 
20 by amino acid at position (294) represented on fig. 4a and fig. 4b. 

Other advantageous recombinant polypeptides of the invention consist in one of the following amino 
acid sequences: 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (295) represented on fig. 5, 

25 - the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (295) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (-43) to the extremity 
constituted by amino acid at position (295) represented on fig. 5. 

Other advantageous recombinant polypeptides of the invention consist in one of the following amino 
30 acid sequences: 

. the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

35 - the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
40 constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

Other advantageous recombinant polypeptides of the invention consist in one of the following amino 
acid sequences: 

45 - the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
50 constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

55 - the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

Other advantageous recombinant polypeptides of the invention consist in one of the following amino 
acid sequences: 
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- the one extending from the extrennity constituted by amino acid at position (-43) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5. 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5. 

5 In eukaryotic cells, these polypeptides can be used as signal peptides, the role of which is to initiate the 
translocation of a protein from its site of synthesis, but which is excised during translocation. 

Other advantageous peptides of the invention consist in one of the following amino acid sequence: 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 3a and fig. 3b, or 

10 - the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
75 constituted by amino acid at position (120) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 3a and -fig. 3b, or 

20 - the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

Other advantageous peptides of the invention consist in one of the following amino acid sequence: 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 4a and fig. 4b, or 

25 - the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
30 constituted by amino acid at position (120) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig, 4a and fig. 4b. or ^ 

35 - the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

Other advantageous peptides of the invention consist in one of the following amino acid sequence: 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 5, or 

40 - the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
45 constituted by amino acid at position (120) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 5. or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 5, or 

50 - the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (295) represented on fig. 5. 

It is to be noted that the above mentioned polypeptides are derived from the expression products of a 
DNA derived from the nucleotide sequence coding for a protein of 32-kDa secreted by Mycobacterium 
tuberculosis as explained hereafter in the examples. 

55 The invention also relates to the amino acid sequences constituted by the above mentioned polypep- 
tides and a protein or an heterologous sequence with respect to said polypeptide, said protein or 
heterologous sequence comprising for instance from about 1 to about 1000 amino acids. These amino acid 
sequences will be called fusion proteins. 
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In an advantageous fusion protein of the invention, the heterologous protein is j8-galactosidase. 
Other advantageous fusion proteins of the invention are the ones containing an heterologous protein 
resulting from the expression of one of the following plasmids: 

5 pEXl 

pEX2 
pEX3 

pUEXl pmTNF MPH 

pUEX2 

^0 pUEX3 

The invention also relates to any nucleotide sequence coding for a polypeptide of the invention. 
The invention also relates to nucleic acids comprising nucleotide sequences which hybridize with the 
'5 nucleotide sequences coding for any of the above mentioned polypeptides under the following hybridization 
conditions: 

- hybridization and wash medium: 3 X SSC. 20% formamide (1 X SSC is 0.15 M NaCI. 0.015 M sodium 
citrate. pH 7.0), 

- hybridization temperature (HT) and wash temperature (WT) for the nucleic acids of the invention defined 
20 by x-y: i.e. by the sequence extending from the extremity constituted by the nucleotide at position (x) to the 

extremity constituted by the nucleotide at position (y) represented on fig. 3a and fig. 3b. 
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55 The above mentioned temperatures are to be considered as approximately ± 5 C. 

The invention also relates to nucleic acids comprising nucleotide sequences which are complementary 
to the nucleotide sequences coding for any of the above mentioned polypeptides. 

It is to be noted that in the above defined nucleic acids, as well as in the hereafter defined nucleic 
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acids, the nucleotide sequences which are brought into play are such that T can be replaced by U. 

A group of preferred nucleic acids of the invention comprises one at least of the following nucleotide 
sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
5 by nucleotide at position (182) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

10 - the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
constituted by nucleotide at position (1358), wherein N represents one of the five A. T, C, G or I 
nucleotides, represented in fig. 3a and fig. 3b, 
or above said nucleotide sequences wherein T is replaced by U, 

or nucleic acids which hybridize with said above mentioned nucleotide sequences or the complementary 
75 sequences thereof. 

A group of preferred nucleic acids of the invention comprises one at least of the following nucleotide 
sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (182) represented in fig. 4a and fig. 4b. - 

20 - the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

. the one extending from the extremity constituted by nucleotide at position (380) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
25 constituted by nucleotide at position (1358). wherein N represents one of the five A. T. C. G or I 

nucleotides, represented in fig. 4a and fig. 4b. 

or above said nucleotide sequences wherein T is replaced by U, 

or nucleic acids which hybridize with said above mentioned nucleotide sequences or the complementary 
sequences thereof. 

30 A group of preferred nucleic acids of the invention comprises one at least of the following nucleotide 
sequences: 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
35 constituted by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1104) to the extremity 
constituted by nucleotide at position (1299), wherein N represents one of the five A, T, C. G or I 
nucleotides, represented in fig. 5, 

or above said nucleotide sequences wherein T is replaced by U, 
40 or nucleic acids which hybridize with said above mentioned nucleotide sequences or the complementary 
sequences thereof. 

Other preferred nucleic acids of the invention comprise one at least of the following nucleotide 
sequences: 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
45 constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

50 - the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

Other preferred nucleic acids of the invention comprise one at least of the following nucleotide 
sequences: 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
55 constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted- by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig, 4b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
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constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

. the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

Another preferred group of nucleic acids of the invention comprises the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

Another preferred group of nucleic acids of the invention comprises the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

According to another advantageous embodiment, nucleic acids of the invention comprises one of the 

following sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (218) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to* the extremity constituted 
by nucleotide at position (272) represented in fig. 3a and fig. 3b. - 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1241) represented In fig. 3a.and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

According to another advantageous embodiment, nucleic acids of the Invention comprises one of the 
following sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (218) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
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by nucleotide at position (272) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1 ) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and-flg. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 
• the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b.. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

These nucleotide sequence can be used as nucleotide signal sequences, coding for the corresponding 
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(273) to the extremity 

(273) to the extremity 
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signal peptide. 

Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

5 - the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (182) represented in fig. 3a and fig. 3b, 

70 - the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented In fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
75 by nucleotide at position (218) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (272) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (359) represented in fig. 3a and fig. 3b. - 

20 ' the one extending from the extremity constituted by nucleotide at position (l) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
25 constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

30 - the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide. at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
35 constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

40 - the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1358) represented In fig. 3a and fig. 3b, 

• the one extending Uorn the extremity constituted by nucleotide at position (273) to the extremity 
45 constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

50 Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

55 Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (182) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
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by nucleotide at position (194) represented in fig. 4a and fig. 4b. 

- tlie one extending fronn tiie extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
5 by nucleotide at position (218) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (272) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

70 - the one extending from the extremity constituted by nucleotide at position (1 ) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
75 constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and ^ig. 4b, 

20 ' the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by ' nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
25 constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

30 " the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position 
35 constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

40 Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (129) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5, 

45 • the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5. 

- the one extending from the- extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
50 by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5. 

55 - the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1104) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5, 
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- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5. 

Preferred nucleic acids of the invention consist in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
5 by nucleotide at position (129) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5. 

10 - the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5. 

• the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
75 by nucleotide at position (1 104) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (219) represented in fig. 5, 

20 - the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
25 constituted by nucleotide at position (1 104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1104) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5. 

30 The invention also relates to any recombinant nucleic acids containing at least a nucleic acid of the 

invention inserted in an heterologous nucleic acid. 

The invention relates more particularly to recombinant nucleic acid such as defined, in which the 

nucleotide sequence of the invention is preceded by a promoter (particularly an inducible promoter) under 

the control of which the transcription of said sequence is liable to be processed and possibly followed by a 
35 sequence coding for transcription termination signals. 

The invention also relates to the recombinant nucleic acids in which the nucleic acid sequences coding 

for the polypeptide of the invention and possibly the signal peptide, are recombined with control elements 

which are heterologous with respect to the ones to which they are normally associated within the bacteria 

gene and, more particularly, the regulation elements adapted to control their expression in the cellular host 
40 which has been chosen for their production. 

The invention also relates to recombinant vectors, particularly for cloning and or expression, comprising 

a vector sequence, notably of the type plasmid. cosmid or phage, and a recombinant nucleic acid of the 

invention, in one of the non essential sites for its replication. 

Appropriate vectors for expression of the recombinant antigen are the following one: 
45 pEXI pmTNF MPH 

pEX2 plGRI 

pEX3 

pUEX1 

pUEX2 
50 pUEX3 

The pEXI. pEX2 and pEX3 vectors are commercially available and can be obtained from Boehringer 
Mannheim. 

The pUEXI, pUEX2 and pUEX3 vectors are also commercially available and can be obtained from 
Amersham. 

55 According to an advantageous embodiment of the invention, the recombinant vector contains, in one of 
its non essential sites for its replication, necessary elements to promote the expression of polypeptides 
according to the invention in a cellular host and possibly a promoter recognized by the polymerase of the 
cellular host, particularly an inducible promoter and possibly a signal sequence and/or an anchor sequence. 
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According to another additional embodiment of the invention, the recombinant vector contains the 
elements enabling the expression by E. coli of a nucleic acid according to the invention inserted in the 
vector, and particularly the elements enTbling the expression of the gene or part thereof of ^-galactosidase. 

The. invention also relates to a cellular host which is transformed by a recombinant vector according to 
the invention, and comprising the regulation elements enabling the expression of the nucleotide sequence 
coding for the polypeptide according to the invention in this host. 

The invention also relates to a cellular host chosen from among bacteria such as E coli . transformed 
by a vector as above defined, and defined hereafter in the examples, or chosen from among eukaryotic 
organism, such as CHO ceils, insect cells, Sf9 cells [Spodoptera frugiperda] infected by the virus Ac NPV 
(Autographa californica nuclear polyhydrosis virus) containing suitable vectors such as pAc 373 pYM1 or 
pVC3, BmN [Bombyx mori] infected by the virus BmNPV containing suitable vectors such as pBE520 or 
p89B310. 

The invention relates to an expression product of a nucleic acid expressed by a transformed cellular 
host according to the invention. 

The invention also relates to nucleotidic probes, hybridizing with anyone of the nucleic acids or with 
their complementary sequences, 

and particularly the probes chosen among the following nucleotidic sequences gathered in Table 1, and 
represented in fig. 9. 

TABLE 1 



Probes A(i), A(ii), A(iii). A(iv) and A(v) 

A(i) cagctTgttgacagggttcgtggc 

A(ii) GGTTGGTGGCGCCGTCACG 
A(iii) CGTCGCGCGCGTAGTGTGGG 
A(iv) CGGCGCCGTCGGTGGCACGGCGA 
A (V) CGTGGGCGCGGCGCTAGTGTCGG 
Probe B 

TCGCCCGCCCTGTACCTG 
Probe C 

GCGCTGACGCTGGCGATCTATC 
Probe D 

CCGCT6TTGAAGGTC6GGAAG 
Probe E 

AAGCCGTCGGATCTGGGTGGCAAC 
Probes F(i). F(ii), F(iii) and F(iv) 
F (i) ACGGCACTGGGTGCCACGCCCAAC- 
F(ii) ACGCCCAACACCGGGCCCGCCGCA 
F (iii) ACGGGCACTGGGTGCGACGCCCAAC 
F(iv) ACGCCCCAACACCGGGCCCGCGCCCCA 
or their complementary nucleotidic sequences. 
The hybridization conditions can be the following ones: 

- hybridization and wash medium: 3 X SSC, 20% formamide (1 X SSC is 0.15 M NaCI, 0.015 M sodium 
citrate, pH 7.0). 

- hybridization temperature (HT) and wash temperature (WT): 
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(WT) 


HT and 


•C: 


WT('C) 


A(i) 


50 


A(ii) 


50 


A(iii) 


52 


A(iv) 


60 


A(v) 


52 


B 


48 


C 


50 


D 


45 


E 


52 


F(i) 


55 


F(ii) 


59 


F(iii) 


55 


F(iv) 


59 



These probes might enable to differentiate M. tuberculosis frorfi other bacterial strains and in particular 
from the following mycobacteria species: 

- Mycobacterium marinum. Mycobacterium scrofulaceum, Mycobacterium gordonae, Mycobacterium szul- 
gai. Mycobacterium intracellulare. Mycobacterium xenopi. Mycobacterium gastri, Mycobacterium nonch- 
romogenicum. Mycobacterium terras and Mycobacterium triviale, and more particularly from M . bo\MS . 
Mycobacterium kansasii, Mycobacterium avium, Mycobacterium phlei and Mycobacterium fortuitum. 

The invention also relates to DNA or RNA primers which can be used for the synthesis of nucleotidic 
sequences according to the invention by PGR (polymerase chain reaction technique), such as described in 
US Patents n" 4,683,202 and n' 4.683,195 and European Patent n' 200362. 

The invention also relates to any DNA or RNA primer constituted by about 15 to about 25 nucleotides 
of a nucleotide sequence coding for a polypeptide according to the invention. 

The Invention also relates to any DNA or RNA primer constituted by about 15 to about 25 nucleotides 
liable to hybridize with a nucleotide sequence coding for a polypeptide according to the invention. 

The invention also relates to any DNA or RNA primer constituted by about 15 to about 25 nucleotides 
complementary to a nucleotide sequence coding for a polypeptide according to the invention. 

The sequences which can be used as primers are given in Table 2 hereafter (sequences PI to P6 or 
their complement) and illustrated in fig. 9 : 

TABLE 2 



PI 
P2 



GAGTACCTGCAGGTGCCGTCGCCGTCGATGGGCCG 
ATCAACACCCCGGCGTTCGAGTGGTAC 
P2 compl. GTACCACTCGAACGCCGGGGTGTTGAT 
P3 TGCCAGACTTACAAGTGGGA 
P3 compl. TCCCACTTGTAAGTCTGGCA 
P4 TCCTGACCAGCGAGCTGCCG 
P4 compl. CGGCAGCTCGCTGGTCAGGA 
P5 CCTGATCGGCCTGGCGATGGGTGACGC 
P5 compl. GCGTCACCCATCGCCAGGCCGATCAGG 
P6 compl. GCGCCCCAGTACTCCCAGCTGTGCGT 

compl. = complement 
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The sequences can be combined in twelve different primer-sets (given in Table 3) which allow 
enzymatical amplification by the polymerase chain reaction (PGR) technique of any of the nucleotide 
sequences of the invention, and more particularly the one extending from the extremity constituted by 
nucleotide at position 1 to the extremity constituted by nucleotide at position 1358. as well as the nucleotide 
5 sequence of antigen a of BCG (17). 

The detection of the PGR amplified product can be achieved by a hybridization reaction with an 
oligonucleotide sequence of at least 10 nucleotides which is located between PGR primers which have been 
used to amplify the DNA. 

The PGR products of the nucleotide sequences of the invention can be distinguished from the a-antigen 
10 gene of BCG or part thereof by hybridization techniques (dot-spot, Southern blotting, etc.) with the probes 
indicated in Table 3. The sequences of these probes can be found in Table 1 hereabove. 

TABLE 3 



75 


Primer set 


Detection with 






probe 




1. PI and the complement of P2 


B 




2. P1 and the complement of P3 


B - 


20 


3. P1 and the complement of P4 


B 




4. PI and the complement of P5 


B or G 




5. P1 and the complement of P6 


B, C, D or E 




6. P2 and the complement of P5 


G 




7. P2 and the complement of P6 


G. D or E 


25 


8. P3 and the complement of P5 


G 




9. P3 and the complement of P6 


G. D or E 




10. P4 and the complement of P5 


G 




1 1 . P4 and the complement of P6 


G, D or E 




12. P5 and the complement of P6 


Dor E 



30 



It is to be noted that enzymatic amplification can also be achieved with* all oligonucleotides with 
sequences of about 15 consecutive bases of the primers given in Table 2. Primers with elongation at the 5 - 
end or with a small degree of mismatch may not considerably affect the outcome of the enzymatic 
amplification if the mismatches do not interfere with the base-pairing at the 3'-end of the primers. 

Specific enzymatic amplification of the nucleotide sequences of the invention and not of the BGG gene 
can be achieved when the probes (given in Table 1) or their complements are used as amplification 
primers. 

When the above mentioned probes of Table 1 are used as primers, the primer sets are constituted by 
any of the nucleotide sequences (A, B. C, D, E, F) of Table 1 in association with the complement of any 
other nucleotide sequence, chosen from A. B. G. D. E or F, it being understood that sequence A means any 
of the sequences A(i), A(ii). A(iii), A(iv), A(v) and sequence F, any of the sequences F(i), F(ii), F(iii) and F(iv). 

Advantageous primer sets for enzymatic amplification of the nucleotide sequence of the invention can 
be one of the following primer sets given in Table 3bis hereafter: 



50 



55 
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TABLE 3 BIS 

A(i) 

or A(ii) 

or A(iii) and the complement of B 

or A(iv) 
or A(v) 
A(i) 
or A(ii) 

or A(iii) and the complement of C 

or A(iv) 
or A(v) 

B and the complement of C 

A(i) 
or A(ii) 

or A(iii) and the complement of F 

or A(iv) 
or A(v) 
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70 



15 



A(i) 

or A(ii) 

or A(iii) and the complenent of D 

or A(iv) 
or A{v) 
A(i) 
or A(ii) 

or A(iii) and the compleaent of E 



and 


the 


complement 


of 
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and 


the 


complement 


of 
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and 


the 


complement 


of 
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and 


the 


complement 


of 
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and 


the 


complement 


of 
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and 


the 


complement 


of 


P 


and 


the 


complement 


of 


E 


and 


the 


complement 


of 


P 


and 


the 


complement 


of 


F 


i). 


A(iv), A(v), 







or A(iv) 
or A(v) 
B 
B 

20 B 

C 
C 

25 ^, 

D 
D 
E 

30 

D, E and F 

having the nucleotide sequence indicated in Table 1. 

35 In the case of amplification of a nucleotide sequence of the invention with any of the above mentioned 
primer sets defined in Table 3bis hereabove, the detection of the amplified nucleotide sequence can be 
achieved by a hybridization reaction with an oligonucleotide sequence of at least 10 nucleotides, said 
sequence being located between the PGR primers which have been used to amplify the nucleotide 
sequence. An oligonucleotide sequence located between said two primers can be determined from figure 9 

40 where the primers A, B, C» D. E and F are represented by the boxed sequences respectively named probe 
region A, probe region B, probe region C. probe region D, probe region E and probe region F. 

The invention also relates to a kit for enzymatic amplification of a nucleotide seguence by PGR 
technique and detection of the amplified nucleotide sequence containing 

- one of the PGR primer sets defined in Table 3 and one of the detection probes of the invention, 
45 advantageously the probes defined in Table 1, 

or one of the PGR primer sets defined in Table Sbis, and a detection sequence consisting for instance in an 
oligonucleotide sequence of at least 10 nucleotides, said sequence being located (fig. 9) between the two 
PGR primers constituting the primer set which has been used for amplifying said nucleotide sequence. 
The invention also relates to a process for preparing a polypeptide according to the invention 
50 comprising the following steps: 

- the culture in an appropriate medium of a cellular host which has previously been transformed by an 
appropriate vector containing a nucleic acid according to the invention, 

- the recovery of the polypeptide produced by the above said transformed cellular host from the above said 
culture medium, and 

55 - the purification of the polypeptide produced, eventually by means of immobilized metal ion affinity 
chromatography (IMAG). 

The polypeptides of the invention can be prepared according to the classical techniques in the field of 
peptide synthesis. 
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The synthesis can be carried out in homogeneous solution or in solid phase. 

For instance, the synthesis technique in homogeneous solution which can be used is the one described 
by Houbenweyl in the book titled "Methode der organischen ch^ie" (Method of organic chemistry) edited 
by E. Wunsh, vol. 15-1 et II. THIEME. Stuttgart 1974. 
5 The polypeptides of the invention can also be prepared according to the method described by R.D. 
MERRIFIELD in the article titled "Solid phase peptide synthesis" (JP. Ham.Socks. . 45 , 2149-2154). 

The invention also relates to a process for preparing the nucleic acids according to the invention. 

A suitable method for chemically preparing the single-stranded nucleic acids (containing at most 100 
nucleotides of the invention) comprises the following steps : 
70 - DNA synthesis using the automatic /3-cyanoethyl phosphoramidite method described in Bioorganic 
Chemistry 4: 274-325, 1986. 

In the case of single-stranded DNA. the material which is obtained at the end of the DNA synthesis can 

be used as such, 

A suitable method for chemically preparing the double-stranded nucleic acids (containing at most 100 
75 bp of the invention) comprises the following steps: 

- DNA synthesis of one sense oligonucleotide using the automatic )8-cyanoethyl phosphoramidite method 
described in Bioorganic Chemistry 4; 274-325. 1986. and DNA synthesis of one anti-sense oligonucleotide 
using said above-mentioned automatic )3-cyanoethyl phosphoramidite method, 

- combining the sense and anti-sense oligonucleotides by hybridization in order to form a DNA duplex. 

20 - cloning the DNA duplex obtained into a suitable plasmid vector and recovery of the DNA according to 
classical methods, such as restriction enzyme digestion and agarose gel electrophoresis. 

A method for the chemical preparation of nucleic acids of length greater than 100 nucleotides - or bp. in 
the case of double-stranded nucleic acids -comprises the following steps : 

- assembling of chemically synthesized oligonucleotides, provided at their ends with different restriction 
25 sites, the sequences of which are compatible with the succession of amino acids in the natural peptide. 

according to the principle described in Proc. Nat. Acad. Sci. USA 80; ,7461-7465, 1983, 

- cloning the DNA thereby obtained into a suitable plasmid vector and recovery of the desired nucleic acid 
according to classical methods, such as restriction enzyme digestion and agarose gel electrophoresis. 

The invention also relates to antibodies themselves formed against the polypeptides according to the 
30 invention. 

It goes without saying that this production is not limited to polyclonal antibodies. 

It also relates to any monoclonal antibody produced by any hybridoma liable to be formed according to 
classical methods from splenic cells of an animal, particularly of a mouse or rat, immunized against the 
purified polypeptide of the invention on the one hand, and of cells of a myeloma cell line on the other hand. 
35 and to be selected by its ability to produce the monoclonal antibodies recognizing the polypeptide which 
has been initially used for the immunization of the animals. 

The invention also relates to any antibody of the invention labeled by an appropriate label of the 
enzymatic, fluorescent or radioactive type. 

The peptides which are advantageously used to produce antibodies, particularly monoclonal antibodies. 
40 are the following ones gathered in Table 4: 



45 



50 



55 
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TABLE 4a (see fig- 4a and 4b) 



Amino acid Amino acid 

position position 

( NHj-terminal ) ( COOH-terminal ) 

12 QVPSPSMGRDIKVQFQSGGA 31 

36 LYLLDGLRAQDDFSGWDINT 55 

77 SFYSDWYQPACRKAGCQTYK 96 

101 LTSELPGWLQANRHVKPTGS 120 

175 KASDMWGPKEDPAWQRNDPL 194 

211 CGNGKPSDLGGNNLPAKFLE 230 

275 KPDLQRHWVPRPTPGPPQGA 294 

TABLE 4b (see fig. 5) 

Amino acid Amino acid 

position position 

( NH2- terminal ) ( COOH-termina 1 ) 

77 SFYSDWYQPACGKAGCQTYK . 96 

276 PDLQRALGATPNTGPAPQGA 295 



The amino acid sequences are given in the 1 -letter code. 

Variations of the peptides listed in Table 4 are also possible depending on their intended use. For 
example, if the peptides are to be used to raise antisera. the peptides may be synthesized with an extra 
cysteine residue added. This extra cysteine residue is preferably added to the amino terminus and 
facilitates the coupling of the peptide to a carrier protein which is necessary to render the small peptide 
immunogenic. If the peptide is to be labeled for use in radioimmune assays, it may be advantageous to 
synthesize the protein with a tyrosine attached to either the amino or carboxyl terminus to facilitate 
iodlnation. These peptides possess therefore the primary sequence of the peptides listed in Table 4 but 
with additional amino acids which do not appear in the primary sequence of the protein and whose sole 
function is to confer the desired chemical properties to the peptides. 

The Invention also relates to a process for detecting in vitro antibodies related to tuberculosis in a 
human biological sample liable to contain them, this process comprising 

- contacting the biological sample with a polypeptide or a peptide according to the invention under 
conditions enabling an in vitro immunological reaction between said polypeptide and the antibodies which 
are possibly present in tFe biological sample and 

- the in vitro detection of the antigen/antibody complex which may be formed. 

Preferably, the biological medium Is constituted by a human serum. 
The detection can be carried out according to any classical process. 

By way of example a preferred method brings into play an immunoenzymatic process according to 
ELISA technique or immunofluorescent or radioimmunological (RIA) or the equivalent ones. 

Thus the invention also relates to any polypeptide according to the invention labeled by an appropriate 
label of the enzymatic, fluorescent, radioactive... type. 

Such a method for detecting In vitro antibodies related to tuberculosis comprises for Instance the 
following steps: 
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- deposit of determined amounts of a polypeptidic composition according to the invention in the wells of a 
titration micropiate, 

- introduction into said wells of increasing dilutions of the serum to be diagnosed. 

- incubation of the micropiate, 

5 - repeated rinsing of the micropiate. 

- introduction into the wells of the micropiate of labeled antibodies against the blood immunoglobulins. 

- the labeling of these antibodies being carried out by means of an enzyme which is selected from among 
the ones which are able to hydrolyze a substrate by modifying the absorption of the radiation of this latter at 
least at a given wave length, 

10 • detection by comparing with a control standard of the amount of hydrolyzed substrate. 

The invention also relates to a process for detecting and Identifying in vitro antigens of M. tuberculosis 
in a human biological sample liable to contain them, this process comprising: 

- contacting the biological sample with an appropriate antibody of the invention under conditions enabling 
an in vitro immunological reaction between said antibody and the antigens of M. tuberculosis which are 

15 possiblTpresent in the biological sample and the in vitro detection of the antigen/antibody complex which, 
may be formed. 

Preferably, the biological medium is constituted by sputum, pleural effusion liquid, broncho-alveolar 
washing liquid, urine, biopsy or autopsy material. 

Appropriate antibodies are advantageously monoclonal antibodies directed against the peptides which 
20 have been mentioned in Table 4. 

The invention also relates to an additional method for the in vitro diagnostic of tuberculosis in a patient 
liable to be infected by Mycobacterium tuberculosis comprising the following steps: 

- the possible previous amplification of the amount of the nucleotide sequences according to the invention, 
liable to be contained in a biological sample taken from said patient by means of a DNA primer set as 

25 above defined, 

- contacting the above mentioned biological sample with a nucleotide probe of the invention, under 
conditions enabling the production of an hybridization complex formed between said probe and said 
nucleotide sequence, 

- detecting the above said hybridization complex which has possibly been formed. 

30 To carry out the in vitro diagnostic method for tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculbslTas above defined, the following necessary or kit can be used, said necessary 
or kit comprising: 

- a determined amount of a nucleotide probe of the invention, 

- advantageously the appropriate medium for creating an hybridization reaction between the sequence to be 
35 detected and the above mentioned probe, 

- advantageously, reagents enabling the detection of the hybridization complex which has been formed 
between the nucleotide sequence and the probe during the hybridization reaction. 

The invention also relates to an additional method for the in vitro diagnostic of tuberculosis in a patient 
liable to be infected by Mycobacterium tuberculosis comprising : 
40 ' contacting a biological sample taken from a patient with a polypeptide or a peptide of the Invention, under 
conditions enabling an in vitro immunological reaction between said polypeptide or peptide and the 
antibodies which are possibly present in the biological sample and 

- the in vitro detection of the antigen/antibody complex which has possibly been formed. 

To carry out the in vitro diagnostic method tor tuberculosis in a patient liable to be infected by 
45 Mycobacterium tubercuTosisTThe following necessary or kit can be used, said necessary or kit comprising: 

- a polypeptide or a peptide according to the invention, 

- reagents for making a medium appropriate for the immunological reaction to occur, 

- reagents enabling to detect the antigen/antibody complex which has been produced by the immunological 
reaction, said reagents possibly having a label, or being liable to be recognized by a labeled reagent, more 

50 particularly in the case where the above mentioned polypeptide or peptide is not labeled. 

The invention also relates to an additional method for the in vitro diagnostic of tuberculosis in a patient 
liable to be infected by M. tuberculosis , comprising the following steps: 

- contacting the biological sample with an appropriate antibody of the invention under conditions enabling 
an in vitro immunological reaction between said antibody and the antigens of M. tuberculosis which are 

55 possibly~present in the biological sample and - the in vitro detection of the antigen/antibody complex which 
may be formed. 

Appropriate antibodies are advantageously monoclonal antibodies directed against the peptides which 
have been mentioned in Table 4. 
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To carry out the in vitro diagnostic method for tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis, the following necessary or kit can be used» said necessary or kit comprising: 

- an antibody of the invention. 

- reagents for making a medium appropriate for the immunological reaction to occur, 

5 - reagents enabling to detect the antigen/antibody complexes which have been produced by the im- 
munological reaction, said reagent possibly having a label or being liable to be recognized by a label 
reagent, more particularly in the case where the above mentioned antibody is not labeled. 
An advantageous kit for the diagnostic in vitro of tuberculosis comprises: 

- at least a suitable solid phase system."e!g~a microtiter-plate for deposition thereon of the biological 
70 sample to be diagnosed in vitro , 

- a preparation containingone of the monoclonal antibodies of the invention, 

- a specific detection system for said monoclonal antibody, 

- appropriate buffer solutions for carrying out the immunological reaction between a test sample and said 
monoclonal antibody on the one hand, and the bonded monoclonal antibodies and the detection system on 

75 the other hand. 

The invention also relates to a kit. as described above, also containing a preparation of one of the 
polypeptides or peptides of the invention, said antigen of the invention being either a standard (for 
quantitative determination of the antigen of M. tuberculosis which is sought) or a competitor, with respect to 
the antigen which is sought, for the kit to beUsed in a competition dosage process. 
20 The invention also relates to an immunogenic composition comprising a polypeptide or a peptide 
according to the invention, in association with a pharmaceutically acceptable vehicle. 

The invention also relates to a vaccine composition comprising among other immunogenic principles 
anyone of the polypeptides or peptides of the invention or the expression product of the invention, possibly 
coupled to a natural protein or to a synthetic polypeptide having a sufficient molecular weight so that the 
25 conjugate is able to induce in vivo the production of antibodies neutralizing Mycobacterium tuberculosis, or 
induce in vivo a cellular immUne response by activating M. tuberculosis antigen-responsive T cells. 

The~piptrdes of the invention which are advantageously used as immunogenic principle have one of the 
following sequences: 

30 TABLE 4a (see fig. 4a and 4b) 

Amino acid Amino acid 

position position 
(NHg-terminal) (COOH-terminal) 





12 


QVPSPSMGRDIKVQFQSGGA 


31 


40 


36 


LYLLDGLRAQDDFSGWDINT 


55 




77 


S FYS DWYQPACRKAGCQT YK 


96 




101 


LTSELPGWLQANRHVKPTGS 


120 




175 


KASDMWGPKEDPAWQRNDPL 


194 


45 


211 


CGNGKPS DLGGNNLPAKFLE 


230 




275 


KPDLQRHWVPRPTPGPPQGA 


294 



50 



55 
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70 



15 



TABLE 4b (see fig. 5) 

Amino acid Aaino acid 

position position 
(NHj-terminal) (COOH-terminal) 

77 SFYSDWYQPACGKAGCQTYK 96 

276 PDLQRALGATPNTGPAPQGA 299 

The amino acid sequences are given in the 1 -letter code. 

Other characteristics and advantages of the invention will appear in the following examples and the 
figures illustrating the invention. 

Figures 1(A) and 1(B) correspond to the identification of six purified Xgtll M. tuberculosis recombinant 
clones. Figure 1(A) corresponds to the EcoRI restriction analysis of clone 15. clone 16. clone 17. clone 19. 
clone 24 and EcoRI-Hindlll digested lambda DNA-molecular weight marker lane (in kilobase pairs) (M) 

20 (Boehringer). . 

Figure 1(B) corresponds to the immunoblotting analysis of crude lysates of E. coli lysogenized with 
clone 15. clone 16. clone 17. clone 19. clone 23 and clone 24. 

Arrow (<-) indicates fusion protein produced by recombinant Xgtll -M-tuberculosis clones. Expression 
and immunoblotting were as described above. Molecular weight (indicated in kDa) were estimated by 
comparison with molecular weight marker (High molecular weight-SDS calibration kit, Pharmacia). 

Figure 2 corresponds to the restriction map of the DNA inserts in the Xgtl 1 M. tuberculosis recombinant 
clones 17 and 24 identified with polyclonal anti-32-kDa (BCG) antiserum as above defined and of clones 
Byl By2 and By5 selected by hybridization with a 120 bp EcoRI-Kpn I restriction fragment of clone 17. 

DNA was isolated from Xgtll phage stocks by using the Lambda Sorb Phage Immunoadsorbent. as 
described by the manufacturer (Promega). Restriction sites were located as described above. Some 
restriction sites were deduced from a computer analysis of the nucleotide sequence. 
The short vertical bars 



25 



30 



05 



(I — \) 



represent linker derived EcoRI sites surrounding the DNA inserts of recombinant clones. The lower part 
represents a magnification of the DNA region containing the antigen of molecular weight of 32-kDa, that has 
been sequenced. Arrows indicate strategies and direction of dideoxy-sequencing. (->) fragment subcioned 
^0 in Bluescribe M13; (<->) fragment subcioned in mp10 and mpll M13 vectors: (■->) sequence deter- 
mined with the use of a synthetic oligonucleotide. 

Figures 3a and 3b correspond to the nucleotide and amino acid sequences of the general formula of 
the antigens of the invention. 

Figures 4a and 4b correspond to the nucleotide and amino acid sequences of one of the antigens of the 

invention. 

Two groups of sequences resembling the E. coli consensus promoter sequences are boxed and the 
homology to the consensus is indicated by italic bold letters. Roman bold letters represent a putative Shine- 
Dalgarno motif. 

The NH2-terminal amino acid sequence of the mature protein which is underlined with a double line 
50 happens to be very homologous - 29/32 amino acids - with the one of MPB 59 antigen (34). Five additional 
ATG codons. upstream of the ATG at position 273 are shown (dotted underlined). Vertical arrows (I) indicate 
the presumed NH2 end of clone 17 and clone 24. The option taken here arbitrarily represents the 59 amino 
acid signal peptide corresponding to ATGiaa- 

Figure 5 corresponds to the nucleotide and amino acid sequences of the antigen of 32-kDa of the 

55 invention. 

The NH2-terminal amino acid sequence of the mature protein which Is underlined with a double line 
happens to be very homologous - 29.32 amino acids - with the one of MPB 59 antigen (34). Vertical arrows 
(1) indicate the presumed NH2 end of clone 1,7 and clone 24. 
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Figure 6 is the hydropatliy pattern of the antigen of the invention of a molecular weight of 32-kDa and of 
the antigen a of BCG (17). 

Figure 7 represents the homology between the amino acid sequences of the antigen of 32-kDa of the 
invention and of antigen a of BCG (revised version). 

Identical amino acids; (:) evolutionarily conserved replacement of an amino acid (.), and absence of 
homology () are indicated. Underlined sequence ( = ) represents the signal peptide, the option taken here 
arbitrarily representing the 43-amino acid signal peptide corresponding to ATGai. Dashes in the sequences 
indicate breaks necessary for obtaining the optimal alignment. 

Figure 8 illustrates the fact that the protein of 32-kba of the invention is selectively recognized by 
human tuberculous sera. 

Figure 8 represents the immunoblotting with human tuberculous sera, and anti-jS-galactosidase mon- 
oclonal antibody. Lanes 1 to 6: E. coli lysate expressing fusion protein (140 kDa); lanes 7 to 12:unfused ^- 
galactosidase (114 kDa). The DNA insert of clone 17 (2.7 kb) was subcloned into pUEX2 and expression of 
fusion protein was induced as described by Bresson and Stanley (4). Lanes 1 and 7 were probed vith the 
anti-jS-galactosidase monoclonal antibody: lanes 4, 5, 6 and 10, 11, 12 with 3 different human tuberculous 
sera highly responding towards purified protein of the invention of 32-kDa; lanes 2 and 3 and 8 and 9 were 
probed with 2 different low responding sera. 

Figure 9 represents the nucleic acid sequence alignment of the 32-kDa protein gene of M. tuberculosis 
of the invention (upper line), corresponding to the sequence in fig. 5r of the gene of fig. 4a and 4b of the 
invention (middle line), and of the gene for antigen a of BCG (lower line). 

Dashes in the sequence indicate breaks necessary for obtaining optimal alignment of the nucleic acid 
sequence. 

The primer regions for enzymatical amplification are boxed (PI to P6). 

The specific probe regions are boxed and respectively defined by probe region A, probe region B, 
probe region C, probe region D. probe region E and probe region F 

It is to be noted that the numbering of nucleotides is different from the numbering of figures 3a and 
figure 3b, and of figure 5, because nucleotide at position 1 (on figure 9) corresponds to nucleotide 234 on 
Figure 3a. and corresponds to nucleotide 91 on figure 5. 

Figure 10a corresponds to the restriction and genetic map of the PIGRI plasmid used in Example IV for 
the expression of the P32 antigen of the invention in E. coli . 

On this figure, underlined restriction sites are unique. 

Figure 10b corresponds to the pIGRI nucleic acid sequence. 

On this figure, the origin of nucleotide stretches used to construct plasmid pIGRI are specified 
hereafter. 
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Position 

3422-206 : lambda PL containing EcoRl blunt-Mboll 
blunt fragment of pPL(A) (Pharmacia) 
207-384 : synthetic DNA sequence 

228-230 : initiation codon ATG of first 

cistron 

234-305 : DNA encoding amino acids 2 to 25 of 

mature mouse TNF 
306-308 : stop codon (TAA) first cistron 

311-312 : initiation codon (ATG) second 

cistron 

385-890 : rrnBT,T2 containing Hindlll-Sspl 
fragment from pKK223 (Pharmacia) 
891-3421 : Dral-EcoRI blunt fragment of pAT,53 

(Bioexcellence) containing the 

tetracycline resistance gene and the 
origin of replication. 



Table 5 hereafter corresponds to the complete restriction site analysis of pIGRI. 
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***************************** 

• RKSTRICTION-SITB ANALYSIS ♦ 



Total number of bases is: 3423. 
Analysis done on the complete sequence. 
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List of cuts by enzyme. 



20 



25 



30 



35 



Ace 


I : 


370 


2765 
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735 


2211 


2868 


2982 


3003 
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1645 
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222 
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1345 
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2329 


2732 
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: 1236 
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LI 


1331 
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20B 
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: 329 


494 


623 


713 


1935 


1977 


2156 
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494 


1935 


1977 
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II 
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2685 
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I 


: 2253 


2487 
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I 


i 15 
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1087 


3048 








Bin 


I* 
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999 


2313 


3035 
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HI 


: 855 


925 


2926 












Bsp 


MI 


I 382 


2361 














Est 


NI 


: 213 


475 


585 


753 


1486 


1499 


1620 


1975 


Cau 


II 


: 4 


663 


716 


1268 


1933 


2159 


2883 


3247 


Cfr 


101 


! 2132 


2486 


2646 


3005 


3014 


3255 






Cfr 


I 


: 1971 


2476 


2884 


3016 


3120 








Cla 


I 


: 3393 

















3287 
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45 



50 
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1335 
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3143 
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3296 
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1130 


1304 
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1774 
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2000 


2062 


2213 


2472 
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2643 


2718 


2870 


2924 


2984 


300 



3158 3186 3318 



50 



55 



EP 0 419 355 A1 



5 





Bin 


PlI : 340 


487 


538 


1019 


1128 


1302 


1402 


1469 


1739 


1772 


196 








1998 


2060 


2211 


2470 


2601 


2641 


2716 


2868 


2922 


2982 


300 








3156 


3184 


3316 


















10 


Hind II I 


! 107 


371 


2766 


















Hind III J 


\ 384 


3386 






















Hinf I 


: 367 


1275 


1671 


1746 


1891 


2112 


2410 


2564 


2784 










II 


: 3 


682 


716 


1077 


1267 


1293 


1440 


1932 


2133 


2159 


239 








2487 


2647 


2723 


2883 


3006 




1010 

J V J V 


1717 

JZ4 / 


17«;k 

JZ30 








tipn 


I : 94 


138 


181 


663 


914 


1900 


7171 

Z X C X 


2975 


1070 


1107 
•J J vZ 




75 


upn 


I* : 6 


























I : 212 
























Mae 


I 


: 364 


899 


1152 


1928 


3187 
















Mae 


II : 274 


698 


944 


1847 


1871 


Z *f u u 


Z -J X u 












Mae 


III 


: 169 


255 


304 


313 


1109 


1775 

X Z Z 


X z o o 


7767 
z z o / 


7md 


1707 
J z uz 


170 
J Z 7 


20 


Hbo 


I 


: 7 




V 7 


QO? 

if \f / 


J O J 




1 OOd 


1 07Q 




7 0*7 7 
Z Z / z 


o ^ o 
z zo 






2318 








JUo / 
















nDO 


II 


: 207 


I £ z 


Q 1 7 

7 1/ 


1 77Q 

X 1 1 y 


loz / 


Od 1 o 
z4 X 7 


Z D 7U 












MDO 


II* 


: 988 


2944 






















nine 


I* 


r 1252 


1436 


3112 


3199 


















nnx 


I 


: 1218 


1542 


1948 


2446 


Z V J u 














25 


nnx 


I* 


: 208 


289 


337 


711 


1467 


1750 


2116 


2143 


2181 


2242 


254 








2811 


3030 


3234 


3794 

J Z 7 t 




• 














nse 


I 


: 179 




ZZ X 


All 
4 J J 




OA 1 




0 0 O 0 


3420 










T 
X 




£ u V X 


lis? 




















Nae 


I 


i 2134 


Z *v O O 




101 1; 


















Nar 


I 


\ 2211 


7ft Aft 


« oz 


10ft 
















30 


Nco 


I 


: 309 
























r*ne 


I 


x 3166 


























III . 


: 166 


Z J V 


1-1 1 


014. 


C C *J 


oca 


Q 0 tt 
J 47 


1649 




1962 


216 




NIa 




2226 


c4h L 


Z J07 


z4oD 


ZD // 


z 7 1 i 


zo57 


2930 


3068 


34 15 






IV 


: 210 






1 i;*7Q 

ID 1 O 


\ C 1 O 

I b i / 




t o ^ a 
1979 


•> Aft ^ 

2093 


2128 


2163 


221 


35 






2530 


2651 


2869 


2893 


2983 


3004 


3042 


3088 


3298 


3341 






Nru 


I ; 


[ 2445 






















Nsp 


BII : 


: 1062 


1 J V / 


ZZ / 0 




















Nsp 


HI ; 


\ 1649 
























Pfl 


HI 


: 293 


70S7 

A V J £ 


7101 
Z X V X 




















Pie 


I : 


375 


1 7 A 




















40 


Pie 


I* : 


1269 


2778 






















Ppu 


MI 


. 1935 


1977 






















Pss 


I ; 


1938 


1980 


2895 




















Pat 


I 


379 
























Hsa 


I : 


210 


3254 






















Sal 


X . ! 


369 


2764 




















45 


Scr 


FI 


4 


213 


475 


585 


683 


716 


753 


1268 


1486 


1499 


162 








1933 


1975 


2159 


2358 


2883 


3247 


3287 












Sdu 


I : 


139 


1335 


1954 


2245 


2832 


2934 


2948 


3143 










Sec 


I ! 


3 


309 


1485 


1968 


2046 


2248 


2881 


2887 


3286 


3300 






Sfa 


NI : 


597 


765 


2392 


2iei 


3178 


3291 














Sfa 


NI* : 


. 1548 


1985 


2380 


3001 


3013 


3202 













50 



55 



10 



15 



Sph I 
Sso II 



Sty I 
Taq I 
Taq I IB 
Taq I IB* 
Tthlllll 
Tthlllll* 
Xba I 
Xho II 
Xma III 
Xmn I 



EP 0 419 355 A1 



2857 






2 


211 
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1547 2149 2290 2765 3078 3393 



993 1004 3040 



Total number of cuts is : 705. 
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Sorted list of enzymes by of cuts. 
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20 


1 Xho II : 


Scr FI : 


18 


i Bgi AI : 


Sso II : 


18 


i Sfa NI* : 


Fnu DII : 


17 


1 Bbv I* : 


Mbo I 


16 


1 Cfr 101 


Dpn I 


\ 16 


i Hga I 


Mnl I* 


: 15 


! Acy I 


Asu I 


: 12 


1 Bin I 


Hae II 


\ 11 


1 Cfr I 


Hae III 


; 11 


! Hga I* 


Rph I 


: 10 


; Mae I 


Bst MI 


: 10 


1 Eco 47III 


Eco RII 


: 10 


; Mnl I 


Sec I 


: 10 


! Mme I* 


Dde I 


: 9 


1 Eco 781 


Binf I 


: 9 


; Nae I 


Hae I 


: 9 


i Bbe I 


Alu I 


9 


1 Bin I* 


Bgi CI 


: 9 


j Nar I 


Mse I 


: 9 


J Fok I* 


Taq I 


: 9 


j Dra II 



8 


1 Tthlllll* : 




1 Ava I 


8 


1 Nsp BII 




[ Taq I IB 


8 


1 Fok I : 




1 Alw NI 


7 


i Pfl MI . : 




1 Dra III 


7 


j Hind II- : 




1 Afl III 


7 


j Dsa I : 




; Cla I 


6 


1 Bsp HI : 




} Eco 571* 


6 


1 Pss I s 




1 Nhe I 


6 


1 Mst I ! 




1 Gsu I* 


6 


j Rgi JII : 


2 


1 Bal I 


6 


i Pie I 


2 


j Eco RV 


6 


1 Mbo II* ! 


2 


1 Sph I 


6 


1 Cvi QI 


: 2 


j Xma III 


5 


1 Acc I 


: 2 


1 Bph I* 


5 


! Bgi I 


: 2 


j Taq IIB* 


5 


} Pie I* 


: 2 


1 Eco 571 


5 


{ Gsu I 


2 


j Kpn I 


5 


1 Ppu MI 


: 2 


1 Xba I 


5 


1 Tthlllll 


: 2 


i Aha III 


5 


; Hind III 


: 2 


j Nru I 




1 Nsp BI 


X 2 


j Ban BI 




1 Bsa I 


: 2 


1 Apa LI 




; Sal I 


: 2 


i Asp 7181 




1 Bbv II 


: 2 


I Eco 311 




1 Bsp HI 


: 2 


1 Nco I 




1 Sty I 


: 2 


j Pst I 




I Eco NI 


: 2 




3 


1 Xmn I 


: 2 
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List of non cutting selected enzymes. 

Aat II . Afl II , Apa I , Asu II , Avr II , Bbv II* # Bel I 
B?l II ; Bsp MI* ; Bsp Mil ; Bss HII , B8t EII , Bst XI , Eco 311* 

50 



55 



EP 0 419 355 A1 



Eco RI , Esp I , Hpa I , Mlu I , Mme I , Ndo I , Not I 

Nsi I , Pma CI , Pvu I , Pvu II , Rsr II , Sac I , Sac II 

Sau I , Sea I , Sci I , S£i I , Sma I , Sna BI , Spe I 

Spl I , Ssp I , Stu I , Taq IIA , Tag IIA* , Tth llll , Vsp I 
Xca I , Xho I r toa I 

Total number of selected enzymes which do not cut: 45 



Figure 11a corresponds to the restriction and genetic nnap of the pmTNF MPH plasmid used in 
Example V for the expression of the P32 antigen of the invention in E. coli . 
Figure 11b corresponds to the pmTNF-MPH nucleic acid sequence. 

On this figure, the origin of nucleotide stretches used to construct plasmid pmTNF-MPH is specified 
hereafter. 

Position 

1-208 : lambda PL containing EcoRI blunt-MboII 

blunt fragment of pPL(A) (Pharmacia) 
209-436 : synthetic DNA fragment 

230-232 : initiation codon (ATG) of mTNF 

fusion protein 
236-307 : sequence encoding AA 2 to 25 of 

mature mouse TNF 
308-384 : multiple cloning site containing 

His^ encoding sequence at position 
315-332 

385-436 : Hindlll fragment containing E, coli 

trp terminator 
437-943 : rrnBT^Tj containing Hindlll-Sspl 

fragment from pKK223 (Pharmacia) 
944-3474 : Dral-EcoRI blunt fragment of pAT^js 

(Bioexcellence) containing the 

tetracycline resistance gene and the 
origin of replication. 



Table 6 hereafter corresponds to the complete restriction site analysis of pmTNF-MPH. 
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***************************** 

* RESTRICTION-SITE ANALYSIS ♦ 
*♦*«*•******♦♦***♦**»****♦*** 

Done on DNA sequence ptnTNFi HPH 



Total number of bases is; 3474 • 
Analysis done on the complete sequence. 



List of cuts by enzyme. 
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Acc 


I : 


371 


2818 




















Acy 


I : 


788 


2264 


2921 


3035 


3056 














Afl 


II : 


387 






















Afl 


III : 


1698 












- 










Aha 


III : 


224 
















3441 


- 




Alu 


I : 


386 


439 


1141 


1398 


1534 


1760 


2382 


2785 


3456 




Alw 


NI 


1289 






















Apa 


I : 


Tic 






















Apa 


LI ; 


1384 










• 












Asp 


7181 : 


210 
















2333 


2582 


267 


Asu 


I : 


341 
2945 


342 
3297 


547 


676 


766 


1988 


2030 


2209 


Ava 


I : 


338 


2043 




















Ava 


II ! 


547 


1988 


2030 


2333 


2582 


2670 












Bal 


I : 


2026 






















Bam 


HI 


: 334 


3093 


« 


















Bbe 


I 


: 2267 


2924 


3038 


3059 
















Bbv 


I 


: 1369 


1788 


1806 


1919 


1922 


2866 


3255 










Bbv 


I* 


: 1070 


1276 


1279 


2026 


2050 


2683 












Bbv 


II 


I 1875 


2738 




















Bgl 


I 


£ 2306 


2540 




















Bin 


I 


: 17 


342 


956 


1054 


1140 


3101 












Bin 


I* 


: 329 


955 


1052 


2366 


3088 














Bsp 


HI 


: 908 


978 


2979 


















Bsp 


MI 


: 2414 






















Bsp 


Mil 


: 354 






















Bst 


NI 


: 215 


528 


638 


806 


1539 


1552 


1673 


2028 


2411 


3340 




Cau 


II 


: 6 


339 


340 


736 


769 


1321 


1986 


2212 


2936 


3300 




Cfr 


101 


: 374 


2185 


2539 


2699 


3058 


3067 


3308 










Cfr 


I 


: 2024 


2529 


2937 


3069 


3173 














Cla 


I 


: 3446 






















Cvi 


JI * 


: 192 


265 


272 


343 


350 


361 


386 


400 


439 


444 


47 






660 


678 


767 


828 


B44 


1141 


1170 


1213 


1224 


1289 


136 






1393 


1398 


1534 


1632 


1658 


1676 


1687 


1760 


1779 


1979 


198 






2026 


2063 


2145 


2189 


2210 


2215 


2353 


2363 


2382 


2423 


248 
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CvL QI 
Dde I 

Dpn I 



Dra II 
Dra III 
Dsa I 
Eco 311 
Eco 47III 
Eco 571 
Eco 571* 
Eco 781 
Eco NI 
Eco RI 
Eco RII 
Eco RV 
Fnu 4 HI 



Fnu DII 



Fok I 
Fok. I* 
Gsu I 
Gsu I* 
Bae I 
Hae II 
Hae III 



Bga I 
Hga 1* 
Bgi AI 
Hgi CI 
Bgi JII 
Bha I 



Bin PlI 



Hind II 
Hind III 
Hint I 



2488 

2947 
211 
135 

11 

2342 
1988 
295 
345 
615 
1826 
216 
1156 
2265 
198 
309 
213 
3285 
401 

1795 

2447 
542 

2493 
468 
816 
2088 
2642 
361 
594 
343 

2210 

3298 
160 

1008 
141 
210 
345 
542 

2115 

3239 
540 

2113 

3237 
109 
384 
368 



2518 2531 2552 2597 2641 2785 2801 2857 2875 

2985 2999 3071 3140 3175 3298 3322 3441 3456 
3306 

571 661 717 1015 1424 1888 



293 



238 336 



950 



962 1040 1048 1059 1134 2010 232 



2373 2645 3004 3095 3122 
2030 2945 
331 



2021 


2940 


















2695 


2976 


3238 
















2922 
2845 


3036 


3057 
















526 


636 


804 


1537 


1550 


1671 ' 


2026 


2409 


3338 




417 


532 


1084 


1290 


1293 


1358 


1501 


1656 


1774 


177 


1908 


1911 


2040 


2054 


2061 


2064 


2183 


2262 


2307 


236 


2532 
1074 


2697 
1655 


2748 
1837 


2855 
1934 


2889 
2056 


2892 
2082 


3170 
2227 


3173 
2237 


3244 
2366 


243 


2498 
852 
2423 


2525 
3370 
2468. 


2654 
3322 


2769 


3125 












828 
1458 
361 


844 
1828 
678 


1224 
220/ 
767 


1676 

^ COT 
ZDS / 

828 


1687 
844 


2026 

4 ? / O 

1224 


2423 

J U JO 

1656 


2480 

J W J ^ 

1676 


2552 
3240 
1687 


202 


2423 


2480 


2531 


2552 


2641 


2875 


2939 


2947 


3071 


317 


183 
1586 
1388 
2179 
2987 

593 


796 
2482 
2007 
2263 
3001 

1074 


2088 
2514 
2298 
2702 

1183 


2238 
3068 
2885 
2920 

1357 


2829 

3196 
3034 

1457 


3055 
1524 


3349 
1794 


3392 
1827 


2017 


205 


2266 


2525 


2656 


2696 


2771 


2923 


2977 


3037 


3058 


321 


3371 
591 


1072 


1181 


1355 


1455 


1522 


1792 


1825 


2015 


205 


2264 


2523 


2654 


2694 


2769 


2921 


2975 


3035 


3056 


320 


3369 
372 
437 

1328 


2819 
3439 
1724 


1799 


1944 


2165 


2463 


2617 


2837 
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Upa II 



Hph I 
Hph I* 
Kpa I 

Mae 1- 
Mae II 
Mae III 
Mbo I 



Mbo II 
Mbo II* 
Hme I* 
Mnl I 

Mnl I* 



Mse I 
Mst I 
Hae I 
Nar I 
Nco I 
Nhe I 
Nla III 



Nla IV 



Nru I 
Nsp BII 
Nsp BX 
Pfl MI 
Pie I 
Pie I* 
PBia CI 
Ppu MI 

PS3 I 

Rsa I 
Sal I 

Scr FI 



Sdu I 
Sec I 

Sfa NI 
Sfa NI* 
Sma I 
Sph I 
Sso II 



5 


339 


355 


^ ^ 
375 


T T C 






1320 


1346 


1493 


198 


2186 


2212 


2450 


2540 


2700 


2776 


2936 


3059 


3068 


3083 


330 


3309 
96 


140 


183 


716 


967 


1953 


2174 


3028 


3073 


3355 




8 


305 


311 


317 
















214 

365 
276 
171 
9 


952 
330 
257 
236 


1205 
751 

1162 
334 


1981 
997 

1278 
948 


3240 
1900 
1341 
960 


1924 

2320 
1038 


2513 
2587 
1046 


2569 
3255 
1057 


3343 
1132 


2008 


232 


2340 
209 


2371 
475 


2643 
970 


3002 
1832 


3093 
1880 


3120 
2472 


2743 










1041 
1305 
372 


2997 
1489 
1271 


3165 
1595 


3252 
2001 


2499 


2683 













210 

2864 
181 
2016 
2187 
2264 
345 
3239 
168 

2222 

212 

2265 

2498 



291 350 764 1520 

3083 3287 3347 

188 223 388 486 

2114 3210 

2541 2701 3069 

2921 3035 3056 



1803 2169 2196 2234 2295 
817 994 3414 3436 



259 



232 


349 


382 


565 


620 


912 


982 


1702 


1881 


201 


2279 


2294. 


2422 


2539 


2725 


2764 


2910 


2983 


3121 


346 


336 


343 


549 


1631 


1670 


1989 


2032 


2146 


2181 


221 


2583 


2704 


2922 


2946 


3036 


3057 


3095 


3141 


3351 


339 



412 


1115 


1360 


2331 












382 


1702 


2910 














295 


2105 


2154 














376 


1807 
















1322 


2831 
















331 


















1986 


2030 
















1991 


2033 


2948 














212 


3307 
















370 


2817 












769 


806 


6 


215 


339 


340 


528 


638 


736 


1552 


1673 


1986 


2028 


2212 


2411 


2936 


3300 


3340 


141 


345 


1388 


2007 


2298 


2885 


2987 


3001 


3196 


5 


338 


345 


1538 


2021 


2099 


2301 


2934 


2940 


650 


818 


2445 


2820 


3231 


3344 








420 


1601 


2038 


2433 


3054 


3066 


3255 






340 


















382 


2910 














804 


4 


213 


337 


338 


526 


636 


734 


767 



1321 153 



335 



153 
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1550 


1671 


1984 


2026 


2210 


2409 


2934 


3298 


3338 


Stu I J 


361 


















Sty I : 


345 


2099 
















Tag I : 


254 


371 


666 


1600 


2202 


2343 


2818 


3131 


3446 


Taq IIB 


i 1802 


















Taq IIB* ' 


: 2804 


















Tthlllll 


40 


1107 
















TthlMII* ' 


: 686 


1075 


1114 














Xba I 


\ 364 


















Xho II 


: 9 


334 


948 


960 


1046 


1057 


3093 






Xma I 


I 338 


















Xma III 


r 2529 


















Xmn I 


: \ 


467 
















Total number of cuts is 


: 743 
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25 List of non cutting selected enzymes. 

Aat II , Asu II , Avr It - , Bbv II* , Bel I , Bgl II , Bsp MI* 

Bss RII , B3t BII , Bst XI , Eco 311* , Esp I , Hpa I , Mlu I 

Kme I , Ndo I , Not I , Nsi I , Pst I , Pvu I , Pvu II 

Rsr II , Sac I , Sac II , Sau I , Sea I , Sci I , Sfi I 

30 Sna BI , Spe I , Spl I* , Ssp I , Taq IIA , Taq IIA* , Tth 1111 
Vsp I , Xca I , Xho I 

Total number of selected enzymes which do. not cut: 38 



Figure 12a corresponds to the restriction and genetic map of the plasmid plG2 used to nnake the 
internnediary construct PIGS Mt32 as described in Example IV for the subcloning of the P32 antigen in 
plasmid pIGRI. 

Figure 12b corresponds to the plG2 nucieic acid sequence. 

On this figure, the origin of nucleotide stretches used to construct plasmid plG2 is specified hereafter. 
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Position 

3300-206 : lambda PL containing EcoRI-MboII blunt 

fragment of pPL(A) (Pharmacia) 
207-266 : synthetic sequence containing multiple 
cloning site and riboso»e binding site 
of which the ATG initiation codon is 
located at position 232-234 
267-772 : rrnBT^Tg containing Hindlll-Sspl 

fragment from pKK223 (Phetrmacia) 

773-3300 : tetracycline resistance gene and origin 
of replication containing EcoRI-Dral 
fragment of pAT 153 (Bioexcellence) 

Table 7 corresponds to the complete restriction site analysis of plQ2. 
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♦ RESTRXCTION-SITB ANALYSIS * 



Done on DNA sequence pIG2 

10 

Total number of bases is: 3301. 
Analysis done on the complete sequence. 
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List of cuts by enzyme. 
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Acc 

Acy 

Afl 

Aha 

Alu 

Alw 

Apa 

Asp 

Asu 

Ava 

Ava 

Bal 

BaxB 

Bbe 

Bbv 

Bbv 

Bbv 

Bgl 

Bin 

Bin 

Bsp 

Bsp 

Bst 

Cau 

Cfr 

Cfr 

Cla 

Cvi 
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LI 

7181 
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I 

BI 
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Eco 311 
Eco 47III 
Eco 571 
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1516 
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230 
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2093 2750 2864 2885 
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2243 




















357 


467 


635 


1368 


1381 
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1227 
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1813 
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2426 
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2814 


2828 
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3270 


3285 
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490 
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2924 
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Figure 13 corresponds to the amino acid sequence of the total fusion protein naTNF-HISG-Pas. 
On this figure : 
the continuous underlined sequence 



50 



represents the mTNF sequence (first 25 amino acids), 
- the dotted underlined sequence 



55 



represents the polylinker sequence. 
- the double underlined sequence 
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represents the extra amino acids created at cloning site, and 

- the amino acid marked with nothing is the antigen sequence starting from the amino acid at position 4 of 
figure 5. 

Figure 14a and 14b correspond to the expression of the mTNF-HiS6-P32 fusion protein in K12AH. given 
in Example VI, with Fig. 14a representing the Coomassie Brilliant Blue stained SDS-PAGE and 14b 
representing immunoblots of the gel with anti-32-kba and anti-mTNF-antibody. 

On fig. 14a. the lanes correspond to the following: 



Lanes 



1. 
2. 
3. 
4. 
5. 
6. 
7. 
8. 
9. 



protein molecular weight markers 
pmTNF-MPH-Mt32 28*C 1 h induction 
pmTNF-MPH-Mt32 42* C 1 h induction 
pmTNF-MPH-Mt32 42* C 2 h induction 
pmTNF-MPH-Mt32 42* C 3 h induction 
pmTNF-MPH-Mt32 28* C 4 h induction 
pmTNF-MPH-Mt32 42* C 4 h induction 
pmTNF-MPH-Mt32 28* C 5 h induction 
pmTNF-MPH-Mt32 42* C 5 h induction 



On fig. 14b, the lanes correspond to the following: 



Lanes 




1. 
2. 
3. 
4. 


pmTNF-MPH-Mt32 28* CI h induction 
pmTNF-MPH-Mt32 42* C 1 h induction 
pmTNF-MPH-Mt32 28* C 4 h induction 
pmTNF-MPH-Mt32 42* C 4 h induction 



Figure 15 corresponds to the IMAC elution profile of the recombinant antigen with decreasing pH as 
presented in Example VII. 

Figure 16 corresponds to the IMAC elution profile of the recombinant antigen with increasing imidazole 
concentrations as presented in Example VII. 

Figure 17 corresponds to the IMAC elution profile of the recombinant antigen with a step gradient of 
increasing imidazole concentrations as presented in Example VII. 



EXAMPLE I: 



MATERIAL AND METHODS 



Screening of the Xgt1 1 M. tuberculosis recombinant DNA library with anti-32-kDa antiserum 

A Xgtll recombinant library constructed from genomic DNA of M. tuberculosis (Erdman strain), was 
obtained from R. Young (35). Screening was performed as described (14.35) with some modifications 
hereafter mentioned. Xgtll infected E. coii Y1090 (10^ pfu per 150 mm plate) were seeded on NZYM plates 
(Gibco)(16) and incubated at 42'C for 24 hrs. To induce expression of the /3-galactosidase-fusion proteins 
the plates were overlaid with isopropyl /S-D-thiogalactoside (IPTG)-saturated filters (Hybond C extra. 
Amersham). and incubated for 2 hrs at 37*0. Screening was done with a polyclonal rabbit anti-32-kDa 
antiserum. Said polyclonal antiserum rabbit anti-32-kDa antiserum was obtained by raising antiserum 
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against the P32 M. bovis BCQ (strain 1173P2 - Instltut Pasteur Paris) as follows: 400 ug (purified P32 
protein of M. bovls" BCG) per ml physiological saline were mixed with one volume of incomplete Freund*s 
adjuvant. The material was homogenized and injected intradermally in 50 ii\ doses, delivered at 10 sites in 
the back of the rabbits, at 0. 4, 7 and 8 weeks (adjuvant was replaced by the diluent for the last injection). 

5 One week later, the rabbits were bled and the sera tested for antibody level before being distributed in 
aliquots and stored at -80 " C. 

The polyclonal rabbit anti-32-kDa antiserum was pre-absorbed on E. coll lysate (14) and used at a final 
dilution of 1:300. A secondary alkaline-phosphatase anti-rabbit IgG conjugate (Promega), diluted at 1:5000 
was used to detect the j8-ga!actosidase fusion proteins. For color development nitro blue tetrazolium (NBT) 

70 and 5-bromo-4-chloro-3-indolyi phosphate (BCIP) were used. Reactive areas on the filter turned deep purple 
within 30 min. Usually three consecutive purification steps were performed to obtain pure clones. iPTG, 
BCIP and NBT were from Promega corp. (Madison WL). 

75 Plaque screening by hybridization for obtaining the secondary clones BY1. By2 and By5 hereafter defined 



The procedure used was as described by Maniatis et al. (14). 



20 Preparation of crude lysates from Xgtll recombinant lysogens 



Colonies of E. coli Y1089 were lysogenized with appropriate Xgt11 recombinants as described by 
Hyunh et al. (14). Single colonies of lysogenized E. coli Y1089 were inoculated into LB medium and grown 
to an optical density of 0.5 at 600nm at 30" C, After a heat shock at 45* G for 20 min.. the production of 0- 
25 galactosidase-fusion protein was induced by the addition of IPTG to a final concentration of 10 mM. 
Incubation was continued for 60 min. at 37* C and cells were quickly harvested by centrifugation. Cells were 
concentrated 50 times in buffer (10 mM Tris pH 8,0, 2 mM EDTA) and rapidly frozen into liquid nitrogen. 
The samples were lysed by thawing and treated with 100 ug/ml DNase I in EcoRI restriction buffer, for 5-10 
minutes at 37" C. 



Immunoblotting (Western blotting) analysis: 

After SDS-PAGE electrophoresis, recombinant lysogen proteins were blotted onto nitrocellulose mem- 

35 branes (Hybond C. Amersham) as described by Towbin et al. (29). The expression of mycobacterial 
antigens, fused to i3-galactosidase in E, coli Y1089 was visualized by the binding of a polyclonal rabbit anti- 
32-kDa antiserum (1:1000) obtained as described in the above paragraph "Screening of the Xgtll M. 
tuberculosis recombinant DNA library with anti-32-kDa antiserum" and using a monoclonal anti-j9-galac- 
tosidase antibody (Promega). A secondary alkaline-phosphatase anti-rabbit IgG conjugate (Promega) diluted 

40 at 1:5000, was used to detect the fusion proteins. 

The use of these various antibodies enables to detect the )3-galactosidase fusion protein. This reaction 
is due to the M. tuberculosis protein because of the fact that non fused-j8-galactosidase is also present on 
the same gel and is not recognized by the serum from tuberculous patients. 

In order to identify selective recognition of recombinant fusion proteins by human tuberculous sera. 

45 nitrocellulose sheets were incubated overnight with these sera (1:50) (after blocking aspecific protein 
binding sites). The human tuberculous sera were selected for their reactivity (high or low) against the 
purified 32-kDa antigen of M. bovis BCG tested in a Dot blot assay as previously described (31). Reactive 
areas on the nitrocellulose sheets were revealed by incubation with peroxidase conjugated goat anti-human 
IgG antibody (Dakopatts. Copenhagen, Denmark) (1:200) for 4 hrs and after repeated washings color 

50 reaction was developed by adding peroxidase substrate (a- chloronaphtol) (Bio-Rad) in the presence of 
peroxidase and hydrogen peroxide. 

Recombinant DNA analysis 

55 

Initial identification of M. tuberculosis DNA inserts in purified Xgtll clones was performed by EcoRI 
restriction. After digestionTlhe excised inserts were run on agarose gels and submitted to Southern 
hybridization. Probes were labeled with a^sp-dCTP by random priming (10). Other restriction sites were 
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located by single and double digestions of recombinant Xgt11 phage DNA or their subcloned 
fragments by Hindlll, psti, Kpnl, AccI and Sphl. 



Sequencing 

Sequence analysis was done by the primer extension dideoxy termination method of Sanger et al. (25) 
after subcloning of specific fragments in Bluescribe-M13 (6) or in mp10 and mpll M13 vectors (Methods in 
Enzymology. vol. 101. 1983, p.20-89, Joachim Messing, New M13 vectors for cloning, Academic Press). 
Sequence analysis was greatly hampered by the high GC content of the M. tuberculosis DNA (65%). 
Sequencing reactions were therefore performed with several DNA polymerases: T7 DNA polymerase 
("Sequenase" USB), Klenow fragment of DNA polymerase I (Amersham) and in some cases with AMV 
reverse transcriptase (Super RT, Anglian Biotechnology Ltd.) and sometimes with dITP instead of dGTP. 
Several oligodeoxynucleotides were synthesized and used to focus ambiguous regions of the sequence. 
The sequencing strategy is summarized in Fig. 2 In order to trace possible artefactual frameshifts in some 
ambiguous regions, a special program was used to define the most probable open reading frame in 
sequences containing a high proportion of GC (3). Several regions particularly prone to sequencing artefacts 
were confirmed or corrected by chemical sequencing (18). For this purpose, fragments were subcloned in 
the chemical sequencing vector pGV462 (21) and analysed as described previously. Selected restriction 
fragments of about 250-350bp were isolated, made blunt-ended by treatment with either Klenow polymerase 
or Mung bean nuclease, and subcloned in the Smal or Hincll site of pGV462. Both strands of the inserted 
DNA were sequenced by single-end labeling at Tth 1111 or BstEll (32) and a modified chemical degradation 
strategy (33). 

Routine computer aided analysis of the nucleic acid and deduced amino acid sequences were 
performed with the LGBC program from Bellon (2). Homology searches used the PASTA programs from 
Pearson and Lipman (23) and the Protein Identification Resource (PIR) from the National Biomedical 
Research Fundation -Washington (NBRF) (NBRF'PIR data bank), release 16 (march 1988). 



RESULTS 



- Screening of the Xgtl 1 M. M. tuberculosis recombinant DNA library with polyclonal anti-32-kDa antiserum : 

Ten filters representing 1.5x10^ plaques were probed with a polyclonal rabbit anti-32-kDa antiserum (8). 
Following purification, six independent positive clones were obtained. 

Analysis of recombinant clones 



EcoRI restriction analysis of these 6 purified Xgtl1 recombinant clones DNA, (Fig. 1A) revealed 4 
different types of insert. Clone 15 had an insert with a total length of 3.8 kb with two additional internal 
EcoRI sites resulting in three DNA fragments of 1.8 kb. 1.5 kb and 0.5 kb. The DNA Insert of clone 16 was 
1.7 kb long. Clones 17 and 19 had a DNA Insert of almost identical length being 2.7 kb and 2.8 kb 
respectively. 

Finally, clone 23 (not shown) and clone 24 both contained an insert of 4 kb with one additional EcoRI 
restriction site giving two fragments of 2.3 kb and 1.7 kb. Southern analysis (data not shown) showed that 
the DNA inserts of clones 15, 16. 19 and the small fragment (1.7 kb) of clone 24 only hybridized with 
themselves whereas clone 17 (2.7 kb) hybridized with itself but also equally well with the 2.3 kb DNA 
fragment of clone 24. Clones 15, 16 and 19 are thus distinct and unrelated to the 17. 23. 24 group. This 
interpretation was further confirmed by analysis of crude lysates of E. coii Y1089 lysogenized with the 
appropriate \gt11 recombinants and induced with iPTG. Western blot analysis (Fig. IB), showed no fusion 
protein, either mature or incomplete, reactive with the polyclonal anti-32-kDa antiserum in cells expressing 
clones 'l 5, 16 and 19. Clones 15. 16 and 19, were thus considered as false positives and were not further 
studied- On the contrary, cells lysogenized with clone 23 and 24 produced an immunoreactive fusion 
protein containing about 10 kDa of the 32-kDa protein. Clone 17 finally expressed a fusion protein of which 
the foreign polypeptide part is about 25 kDa long. The restriction endonuclease maps of the 2.3 kb insert of 
clone 24 and of the 2.7 kb fragment of clone 17 (Fig. 2) allowed us to align and orient the two inserts 
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suggesting that the latter corresponds to a iO.5 kb 5 extension of the first. 

As clone 17 was incomplete, the same Xgt11 recombinant M. tuberculosis DNA library was screened by 
hybridization with a 120 bp EcoRI-Kpnl restriction fragment corresponding to the very 5 end of tlie DNA 
insert of clone 17 (previously subcloned in a Blue Scribe vector commercialized by Vector cloning Systems 
5 (Stratagene Cloning System) (Fig. 2). Three s'-extended clones Byl, By2 and ByS were isolated, analyzed 
by restriction and aligned. The largest insert. ByS contained the Information for the entire coding region (see 
below) flanked by 3.1 kb upstream and 1.1 kb downstream (Fig. 2). 



70 DNA sequencing 



The 1358 base pairs nucleotide sequence derived from the various Xgt11 overlapping clones is 
represented in Fig. 3a and Fig. 3b. The DNA sequence contains a 1059 base pair open reading frame 
starting at position 183 and ending with a TAG codon at position 1242. It occurs that the NH2-terminal 

75 amino-acid sequence, (phe-ser-arg-pro-gly-leu-pro-vat-giu-tyr-leu-gln-val-pro-ser-pro-ser-met-gly-arg-asp-ile- 
lys-val-gln-phe-gln-ser-gly-gly-ala-asn) which can be located within this open reading frame from the 
nucleotide sequence beginning with a TTT codon at position 360 corresponds to the same NH2-terminal 
amino acid sequence of the MPB 59 antigen except for the amino acids at position 20, 21. 31, which are 
respectively gly. cys and asn in the MPB 59 (34). Therefore, the DNA region upstream of this sequence is 

20 expected to encode a signal peptide required for the secretion of a protein of 32-kDa. The mature protein 
thus presumably consists of 295 amino acid residues from the N-terminal Phe (TTT codon) to the C- 
terminal Ala (GCC codon) (Fig. 5). 

Six ATG codons were found to precede the TTT at position 360 In the same reading frame. Usage of 
any of these ATGs in the same reading frame would lead to the synthesis of signal peptides of 

25 29.42,47.49.55 and 59 residues. 



Hydropathy pattern 

30 The hydropathy pattern coding sequence of the protein of 32-kDa of the invention and that of the 
antigen a of BCG (17) were determined by the method of Kyte and Doolittle (15). The nonapeptide profiles 
are shown in Fig. 6. Besides the initial hydrophobic signal peptide region, several hydrophilic domains could 
be identified. It is interesting to note that the overall hydrophilicity pattern of the protein of 32-kDa of the 
invention is comparable to that of the antigen a. For both proteins, a domain of highest hydrophilicity could 

35 be identified between amino acid residues 200 and 250. 



Homology 



40 Matsuo et al. (17) recently published the sequence of a 1095 nucleotide cloned DNA corresponding to 
•the gene coding for the antigen a of BCG. The 978 bp coding region of M. bovis antigen a as revised in 
Infection and Immunity, vol. 58, p. 550-556, 1990. and 1017 bp coding regions of the protein of 32-kDa of 
the invention show a 77.5% homology, in an aligned region of 942 bp. At the amino acid level both 
precursor protein sequences share 75.6% identical residues. In addition. 17.6% of the amino acids 

45 correspond to evolutionary conserved replacements as defined in the algorithm used for the comparison 
(PAM250 matrix, ref 23). Figure 7 shows sequence divergences in the N-terminal of the signal peptide. The 
amino terminal sequence - 32 amino acids - of both mature proteins is identical except for position 31. 

50 Human sera recognize the recombinant 32-kDa protein 



Fig. 8 shows that serum samples from tuberculous patients when Immunoblotted with a crude E. coli 
extract expressing clone 17 distinctly react with the 140 kDa fusion protein (lanes 4 to 6) contain the protein 
of 32-kDa of the invention, but not with unfused i3*galactosidase expressed in a parallel extract (lanes 10 to 
55 12). Serum samples from two negative controls selected as responding very little to the purified protein of 
32-kDa of the invention does neither recognize the 140 kDa fused protein containing the protein of 32-kDa 
of the invention, nor the unfused j3-galactosidase (lanes 2, 3 and 8 and 9). The 140 k-Da fused protein and 
the unfused j3-galactosidase were easily localized reacting with the anti-i3-galactosidase monoclonal 
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antibody (lanes 1 to 7). 

The invention has enabled to prepare a DNA region coding particularly for a protein of 32-kDa (cf. fig.5): 
said DNA region containing an open reading frame of 338 codons (stop codon non included). At position 
220 a TIT encoding the first amino acid of the mature protein is followed by the 295 triplets coding for the 

5 mature protein of 32-kDa- The size of this open reading frame, the immunoreactivity of the derived fusion 
proteins, the presence of a signal peptide and, especially, the identification within this gene of a NH2- 
terminal region highly homologous to that found in the MPB 59 antigen (31/32 amino acids homology) and 
in the BCG antigen a (31/32 amino acids homology) (see Fig. 7). strongly suggest that the DNA fragment 
described contains the complete cistron encoding the protein of 32-kDa secreted by M. tuberculosis , which 

10 had never been so far identified in a non-ambiguous way. 

Six ATG codons were found to precede this TTT at position 220 in the same reading frame. Usage of 
any of these ATGs in the same reading frame would lead to the synthesis of signal peptides of 43, 48. 50. 
56 or 60 residues. Among these various possibilities, initiation is more likely to take place either at ATG31 
or ATGs 2 because both are preceded by a plausible E. colt -like promoter and a Shine-Dalgarno motif. 

75 If initiation takes place at ATGai . the corresponding signal peptide would code for a rather long peptide 
signal of 43 residues. This length however is not uncommon among secreted proteins from Gram positive 
bacteria (5). It would be preceded by a typical E coli Shine-Dalgarno motif (4/6 residues homologous to 
AGGAGG) at a suitable distance. 

If initiation takes place at ATG52. the corresponding signal peptide would code for a peptide signal of 56 

20 residues but would have a less stringent Shine-Dalgarno ribosome binding site sequence. 

The region encompassing the translation termination triplet was particularly sensitive to secondary 
structure effects which lead to so-called compressions on the sequencing gels. In front of the TAG 
termination codon at position 1105, 22 out of 23 residues are G-C base pairs, of which 9 are G*s. 

Upstream ATG130, a sequence resembling an E. coli promoter (11) comprising an hexanucleotide 

25 (TTGAGA) (homology 5/6 to TTGACA) and a AAGAAT box (homology 4/6 to TATAAT) separated by 16 
nucleotides was observed. Upstream the potential initiating codon ATGa*, one could detect several 
sequences homologous to the E. coli "-35 hexanucleotide box", followed by a sequence resembling a 
TATAAT box. Among these, the most suggestive is illustrated on Fig. 3a and 3b. It comprises a TTGGCC at 
position 59 (fig. 3a and 3b) (homology 4/6 to TTGACA) separated by 14 nucleotides from a GATAAG 

30 (homology 46 to TATAAT). Interestingly this putative promoter region shares no extensive sequence 
homology with the promoter region described for the BCG protein a-gene (17) nor with that described for 
the 65 kDa protein gene (26. 28). 

Searching the NBRF data bank (issue 16,0) any significant homology between the protein of 32-kDa of 
the invention and any other completely known protein sequence could not be detected. In particular no 

35 significant homology was observed between the 32-kDa protein and a and j9 subuntts of the human 
fibronectin receptor (1). The NH2-terminal sequence of the 32-kDa protein of the invention is highly 
homologous - 29/32 amino acids - to that previously published for BOG MPB 59 antigen (34) and to that of 
BCG a-antigen - 31/32 amino acids - (Matsuo. 17) and is identical in its first 6 amino acids with the 32-kDa 
protein of M bovis BCG (9). However, the presumed initiating methionine precedes an additional 29 or 42 

40 amino acid"Tiydrophobic sequence which differs from the one of a-antigen (of. Fig. 7). but displaying all the 
characteristics attributed to signal sequences of secreted polypeptides in prokaryotes (22). 

Interestingly, no significant homology between the nucleic acid (1-1358) of the invention (cf. fig. 3a and 
3b) and the DNA of the antigen a of Matsuo exists within their putative promoter regions. 

45 

EXAMPLE II : CONSTRUCTION OF A BACTERIAL PLASMID CONTAINING THE ENTIRE CODING 
SEQUENCE'OF THE 32-kDa PROTEIN OF M. TUBERCULOSIS 

In the previous example, in figure 2, the various overlapping Xgtl 1 isolates covering the 32-kDa protein 
50 gene region from M. tuberculosis were described. Several DNA fragments were subcloned from these xgtl 1 
phages in the Blue Scribe Ml 3+ plasmid (Stratagene). Since none of these plasmids contained the entire 
coding sequence of the 32-kDa protein gene, a plasmid containing this sequence was reconstructed. 



55 Step 1 : Preparation of the DNA fragments : 

1) The plasmid BS-By5-800 obtained by subcloning Hindlil fragments of By 5 (cf. fig. 2) into the Blue 
Scribe Ml 3* plasmid (Stratagene). was digested with Hindlil and a fragment of 800 bp was obtained and 
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isolated from a 1% agarose gel by electroelution. 

2) The plasmid BS-4.1 obtained by subcloning the 2,7 kb EcoRI insert from Xgt1M7) into the Blue 
Scribe M13* plasmid (Stratagene) (see fig 2 of patent application) was digested with Hindlll and SphI 
and a fragment of 1500 bp was obtained and isolated from a 1% agarose gel by electroelution. 
5 3) Blue Scribe M13* was digested with Hindlll and Sphl, and treated with calf intestine alkaline 
phosphatase (special quality for molecular biology, Boehringer Mannheim) as indicated by the manufac- 
turer. 

Step 2 : ligation : 

10 

The ligation reaction contained : 
125 ng of the 800 bp Hindlll fragment (1) 
125 ng of the 1500 bp Sphl-Hindlll insert (2) 
50 ng of the Hindlll-SphI digested BSM13* vector (3) 
75 2 ul of 10 ligation buffer (Maniatis et al., 1982) 
1 ul of (= 2,5 U) of T4 DNA ligase (Amersham) 
4 m PEG 6000. 25% (w/v) 
8UIH20 

The incubation was for 4 hours at 16* C. 

20 

Step 3 : Transformation : 

100 ul of DH5a E. coli (Gibco BRL) were transformed with 10 ul of the ligation reaction (step 2) and 
25 plated on IPTG. X-Gil ampicillin plates, as indicated by the manufacturer About 70 white colonies were 
obtained. 



step 4 : 

30 

As the 800 bp fragment could have been inserted in both orientations, plasmidic DNA from several 
clones were analyzed by digestion with Psti in order to select one clone (different from clone 11), 
characterized by the presence of 2 small fragments of 229 and 294 bp. This construction contains the 
Hindlll-Hindlll-SphI complex in the correct orientation. The plasmid containing this new construction was 
35 called : "BS.BKP32.complet". 



EXAMPLE 111 : EXPRESSION OF A POLYPEPTIDE OF THE INVENTION IN E. COLI: 

40 The DNA sequence coding for a polypeptide, or part of it. can be linked to a ribosome binding site 
which is part of the expression vector, or can be fused to the information of another protein or peptide 
already present on the expression vector. 

in the former case the information is expressed as such and hence devoid of any foreign sequences 
(except maybe for the aminoterminal methionine which is not always removed by E. coli ). 
45 In the latter case the expressed protein is a hybrid or a fusion protein. 

The gene, coding for the polypeptide, and the expression vector are treated with the appropriate 
restriction enzyme(s) or manipulated otherwise as to create termini allowing ligation. The resulting recom- 
binant vector is used to transform a host. The transformants are analyzed for the presence and proper 
orientation of the inserted gene. In addition, the cloning vector may be used to transform other strains of a 
50 chosen host. Various methods and materials for preparing recombinant vectors, transforming them to host 
cells and expressing polypeptides and proteins are described by Panayatatos, N.. in "Plasmids, a practical 
approach (ed. K.G. Hardy, IRL Press) pp.1 63-1 76. by Otd and Primrose, principals of gene manipulation (2d 
Ed. 1981) and are well known by those skilled in the art. 

Various cloning vectors may be utilized for expression. Although a plasmid is preferable, the vector may 
55 be a bacteriophage or cosmld. The vector chosen should be compatible with the host cell chosen. 

Moreover, the plasmid should have a phenotypic property that will enable the transformed host cells to 
be readily identified and separated from those which are not transformed. Such selection genes can be a 
gene providing resistance to an antibiotic like for instance, tetracyclin, carbenicillin. kanamycin, chloram- 
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phenicol, streptomycin, etc. 

In order to express the coding sequence of a gene in E coli the expression vector should also contain 
the necessary signals for transcription and translation. 

Hence it should contain a promoter, synthetic or derived from a natural source, which is functional in E. 
coli . Preferably, although usually not absolutely necessary, the promoter should be controllable by the 
manipulator. Examples of widely used controllable promoters for expression in E. coli are the lac. the trp. 
the tac and the lambda PL and PR promoter. 

Preferably, the expression vector should also contain a terminator of transcription functional in E. coli . 
Examples of used terminators of transcription are the trp and the rrnB terminators. 

Furthermore, the expression vector should contain a ribosome binding site, synthetic or from a natural 
source, allowing translation and hence expression of a downstream coding sequence. Moreover, when 
expression devoid of foreign sequences is desired, a unique restriction site, positioned in such a way that it 
allows ligation of the sequence directly to the initiation codon of the ribosome binding site, should be 
present. 

A suitable plasmid for performing this type of expression is pKK233-2 (Pharmacia). This plasmid 
contains the trc promoter, the lac Z ribosome binding site and the rrnB transcription terminator. 

Also suitable is plasmid pIGRI (Innogenetics. Ghent. Belgium). This plasmid contains the tetracycline 
resistance gene and the origin of replication of pATi53 (available from Bioexceilence. Biores B.V., Woerden, 
The Netherlands), the lambda PL promoter up to the Mbo II site in4he S untranslated region of the lambda 
N gene (originating from pPL(X)); Pharmacia). 

Downstream from the PL promoter, a synthetic sequence was introduced which encodes a "two 
cistron" translation casette whereby the stop codon of the first cistron (being the first 25 amino acids of 
TNF. except for Leu at position 1 which is converted to Vai) is situated between the Shine-Dalgarno 
sequence and the initiation codon of the second ribosome binding site. The restriction and genetic map of 
pIGRI iS represented in Fig. 10a. 

Fig. 10b and Table 5 represenx respectively the nucleic acid sequence and complete restriction site 

analysis of plGRI. 

However, when expression as a hybrid protein is desired, then the expression vector should also 
contain the coding sequence of a peptide or polypeptide which is (preferably highly) expressed by this 
vector in the appropriate host. 

In this case the expression vector should contain a unique cleavage site for one or more restriction 
endonucleases downstream of the coding sequence. 

Plasmids pEXI, 2 and 3 (Boehringer. Mannheim) and pUEXI. 2 and 2 (Amersham) are useful for this 
purpose. 

They contain an ampicillin resistance gene and the origin of replication of pBR322 (Bolivar at al. (1977) 
Gene 2 . 95-113). the lac Z gene fused at its 5 end to the lambda PR promoter together with the coding 
seguence for the 9 first amino acids of its natural gene cro, and a multiple cloning site at the 3 end of the 
lac Z coding sequence allowing production of a beta galactosidase fused polypeptide. 

The pUEX vectors also contain the C1857 allele of the bacteriophage lambda CI repressor gene. 

Also useful is plasmid pmTNF MPH (Innogenetics). It contains the tetracycline resistance gene and the 
origin of replication of pATi5 3 (obtainable from Bioexceilence, Biores B.V., Woerden. The Netherlands), the 
lambda PL promoter up to the Mbo II site in the N gene s' untranslated region (originating from pPL(X): 
Pharmacia), followed by a synthetic ribosome binding site (see sequence data), and the information 
encoding the first 25 AA of mTNF (except for the initial Leu which is converted to Val). This sequence is. in 
turn, followed by a synthetic polylinker sequence which encodes six consecutive histidines followed by 
several proteolytic sites (a formic acid. CNBr, kallikrein, and E. coli protease VI! sensitive site, respectively), 
each accessible via a different restriction enzyme which is unique for the plasmid ( Sma I, Nco I. Bsp Mil 
and Stu I. respectively; see restriction and genetic map. Fig. 11a). Downstream from the polylinker, several 
transcription terminators are present including the E. coli trp terminator (synthetic) and the rrnBT'T2 - 
(originating from pKK223-3; Pharmacia). The total nucleic acid sequence of this plasmid is represented in 
Fig. 11b. 

Table 6 gives a complete restriction site analysis of pmTNF MPH. 

The presence of 6 successive histidines allows purification of the fusion protein by Immobilized Metal 
Ion Affinity Chromatography (IMAC). 

After purification, the foreign part of the hybrid protein can be removed by a suitable protein cleavage 
method and the cleaved product can then be separated from the uncteaved molecules using the same 
IMAC based purification procedure. 

In all the above-mentioned plasmids where the lambda PL or PR promoter is used, the promoter is 



EP 0 419 355 A1 



temperature-control led by means of the expression of the lambda cl ts 857 allele which is either present on 
a defective prophage incorporated in the chromosome of the host (K12AH, ATCC n" 33767) or on a second 
compatible plasmid (pACYC derivative). Only In the pUEX vectors is this cl allele present on the vector 
itself. 

5 It is to be understood that the plasmids presented above are exemplary and other plasmids or types of 
expression vectors maybe emplpyed without departing from the spirit or scope of the present Invention. 

If a bacteriophage or phagemid is used, instead of plasmid, it should have substantially the same 
characteristics used to select a plasmid as described above. 

TO 

EXAMPLE IV : SUBCLONING OF THE P32 ANTIGEN IN PLASMID pIGRI : 

Fifteen ug of plasmid "BS-BK-P32 complet" (see Example 11) was digested with Ed XI and Bst Ell 
(Boehringer, Mannheim) according to the conditions recommended by the supplier except that at least 3 
15 units of enzyme were used per ug of DNA. Eel XI cuts at position 226 (Fig. 5) and Bst Ell at position 1136, 
thus approaching very closely the start and'^op codon of the mature P32 antigen. This DNA is hereafter 
called DNA coding for the "P32 antigen fragment". 

The DNA coding for the "P32 antigen fragment" (as defined above) is subcloned in pIGRI (see fig. 10a) 
for expression of a polypeptide devoid of any foreign sequences. To bring the ATG codon of the expression 
20 vector in frame with the P32 reading frame, an intermediary construct is made in plG2 (for restriction and 
genetic map, see fig. 12a; DNA sequences, see fig. 12b; complete restriction site analysis, see Table 7). 

Five ug of plasmid plG2 is digested with Nco I. Its 5* sticky ends are filled in prior to de- 
phosphoryiation. 

Therefore, the DNA was incubated in 40 ul NB buffer (0.05 M Tris-CI pH 7.4; 10 mM MgCIa: 0.05% 0- 
25 mercaptoethanol) containing 0.5 mM of all four dXTP (X = A,T,C,G) and 2 ut of Klenow fragment of E. coli 
DNA polymerase i (5 U/ul, Boehringer. Mannheim) for at least 3 h at 15* C. 

After blunting, the DNA was once extracted with one volume of phenol equilibrated against 200 MM 
Tris- 01 pH 8, twice with at least two volumes of diethylether and finally collected using the "gene clean 
j^HT.M... (BiolOI) as recommended by the supplier. The DNA was then dephosphorylated at the 5' ends in 
30 30 ul of CIP buffer (50 mM TrisCI pH 8. 1 mM ZnCb) and 20 to 25 units of calf intestine phosphatase (high 
concentration. Boehringer, Mannheim). The mixture was incubated at 37*0 for 30 min, then EGTA 
(ethyleneglycoi bis (^-aminoethylether)-N.N.N\N' tetraacetic acid) pH 8 is added to a final concentration of 
10 mM. The mixture was then extracted with phenol followed by diethylether as described above, and the 
DNA was precipitated by addition of 1/10 volume of 3 M Kkc (Ac = CH3OOO) pH 4.8 and 2 volumes of 
35 ethanol followed by storage at -20° 0 for at least one hour. 

After centrifugation at 13000 rpm in a Biofuge A (Hereaus) for 5 min the pelleted DNA was dissolved in 
H2O to a final concentration of 0.2 ug/ul. 

The Eel Xl-Bst Ell fragment, coding for the "P32 antigen fragment" (see above) was electrophoresed on 
a 1% agarose "gel (BRL) to separate it from the rest of the plasmid and was isolated from the gel by 
40 centrifugation over a Millipore HVLP filter (0 2cm) (2 min., 13000 rpm, Biofuge at room temperature) and 
extracted with Tris equilibrated phenol followed by diethylether as described above. 

The DNA was subsequently collected using the "Gene clean kit'^'^ " (Biol 01) as recommended by the 
supplier. 

After that, the 5 sticky ends were blunted by treatment with the Klenow fragment of E. coli DNA 
45 polymerase I as described above and the DNA was then again collected using the "Gene clean kiT'^ " in 
order to dissolve it in 7 ul of H2O. 

One ul of vector DNA is added together with one ul of 10 x ligase buffer (0.5 M TrisCI pH 7.4, 100 mM 
MgCl2, 5 mM ATP, 50 mM DTT (dithiothreitol)) and 1 ul of T4 DNA ligase (1 unit'ul, Boehringer, 
Mannheim). Ligation was performed for 6 h at 13* C and 5 ul of the mixture is then used to transform strain 
50 DH1 (lambda) [strain DH1 - ATCC N* 33849 - lysogenized with wild type bacteriophage X] using standard 
transformation techniques as described for instance by Maniatis et al. in "Molecular cloning, a laboratory 
manual". Cold Spring Harbor Laboratory (1982). 

Individual transformants are grown and lysed for plasmid DNA preparation using standard procedures 
(Experiments with gene fusions, Cold Spring Harbor Laboratory (1984) (T.J. Silhavy, H.L. Berman and L.W. 
55 Enquist. eds) and the DNA preparations are checked for the correct orientation of the gene within the 
plasmid by restriction enzyme analysis. 

A check for correct blunting is done by verifying the restoration of the Nco I site at the 5 and 3 end of 
the antigen coding sequence. One of the clones containing the P32 antigen fragment in the correct 
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orientation is kept for further worl< and designated plG2-Mt32. In this intermediary construct, the DNA 
encoding the antigen is not in frame with the ATG codon. However, it can now be moved as a Ncol 
fragment to another expression vector. 

15 ug of plG2-Mt32 is digested with Nco I. The Nco I fragment encoding the P32 antigen Is gel purified 
and blunted as described above. After purification, using "gene clear kit TM" it is dissolved in 7 ul of H2O. 

5 ug of plasmid plGRI is digested with Ncol, blunted and dephosphorylated as described above. After 
phenol extraction, followed by diethylether and ethanolprecipitatlon, the pellet is dissolved in H2O to a final 
concentration of 0.2 ug/ul. 

Ligation of vector and "antigen fragment" DNA is carried out as described above. The ligation mixture 
is then transformed into strain DH1 (lambda) and individual transformants are analysed for the correct 
orientation of the gene within the plasmid by restriction^ enzyme analysis. A check for correct blunting is 
done by verifying the creation of a new Nsi I site at the 5 and 3' ends of the antigen coding sequence. One 
of the clones containing the P32 antigerTfragment in the correct orientation is kept for further work and 
designated piGRI-Mt32, 

EXAMPLE V: SUBCLONING OF THE P32 ANTIGEN IN pmTNF MPH: 

Fifteen ug of the plasmid plG2 Mt32 (see example IV) was digested with the restriction enzyme Nco I 
(Boehringer, Mannheim), according to the conditions recommended by the supplier except that at least 3 
units of enzyme were used per ug of DNA. 

After digestion, the reaction mixture is extracted with phenol equilibrated against 200mM TrisCI pH 8. 
(one volume), twice with diethylether (2 volumes) and precipitated by addition of 1/10 volume of 3 M KAc 
(Ac = CH3C00) pH 4.8 and 2 volumes of ethanol followed by storage at -20* C for at least one hour. 

After centrifugation for 5 minutes at 13000 rpm in a Biofuge A (Hereaus) the DNA is electrophoresed on 
a 1% agarose gel (BRL). 

The DNA coding for the "P32 antigen fragment" as described above, is isolated by centrifugation over a 
Millipore HVLP filter (0 2cm) (2 minutes. 1300 rpm. Biofuge at room temperature) and extracted one with 
trisCi equilibrated phenol and twice with diethylether. The DNA is subsequently collected using "Gene clean 
kit (Bio 101) and dissolved in 7ul of H2O. 

The 5' overhanging ends of the DNA fragment generated by digestion with Nco I were filled in by 
incubating the DNA in 40 ul NB buffer (0.05 M Tris-HCI. pH 7.4; 10 mM MgCb: 0.05% ^-mercapteothanol) 
containing 0.5 mM of all four dXTPS (X = A. T, C. G) and 2ul of Klenow fragment of E. coli DNA 
polymerase I (5 units/ul Boehringer Mannheim) for at least 3 h at 15*0. After blunting, the DNA was 
extracted with phenol, followed by diethylether, and collected using a "gene clean kit " as described 
above. 

Five ug of plasmid pmTNF MPH is digested with Stu I. subsequently extracted with phenol, followed by 
diethylether, and precipitated as described above. ThiTestriction digest is verified by electrophoresis of a 
0.5 ug sample on an analytical 1,2% agarose gel. 

The plasmid DNA is then desphosphorylated at the 5' ends to prevent self-ligation in 30ul of CIP buffer 
(50 mM TrisCI pH 8. ImM ZnCl2) and 20 to 25 units of calf intestine phosphatase (high concentration, 
Boehringer Mannheim). The mixture is incubated at 37* C for 30 minutes, then EGTA (ethyleneglycol bis (^- 
aminoethylether)-N.N.N'.N' tetraacetic acid) pH8 is added to a final concentration of lOmM. The mixture is 
extracted with phenol followed by diethylether and the DNA is precipitated as described above. The 
precipitate is pelleted by centrifugation in a Biofuge A (Hereaus) at 13000 rpm for 10 min at 4*0 and the 
pellet is dissolved in H2O to a final DNA concentration of 0.2 ug/ul. 

One ul of this vector DNA is mixed with the 7 ul solution containing the DNA fragment coding for the 
"P32antigen fragment" (as defined above) and 1 ut 10 x ligase buffer (0.5 M TrisCI pH7.4. 100 mM MgCl2. 
5 mM ATP. 50 mM DTT (dithiothreitol)) plus 1 ul T4 DNA ligase (1 unit'ul. Boehringer Mannheim) is added. 
The mixture is incubated at 13" C for 6 hours and 5 ul of the mixture is then used for transformation Into 
strain DHi (lambda, using standard transformation techniques are described by for instance Maniatis et al. in 
"Molecular cloning, a laboratory manual", Cold Spring Harbor Laboratory (1982). 

Individual transformants are grown and then lysed for plasmid DNA preparation using standard 
procedures (Experiments with gene fusions. Cold Spring Harbor Laboratory 1984 (T.J. Silhavy. M.L. Berman 
and L.W. Enquist eds.)) and are checked for the correct orientation of the gene within the plasmid by 
restriction enzyme analysis. 

One of the clones containing the DNA sequence encoding the antigen fragment in the correct 
orientation was retained for further work and designated pmTNF-MPH-Mt32. It encodes all information of the 
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P32 antigen starting from position +4 in the amino acid sequence (see fig. 5). The amino acid sequence of 
the total fusion protein is represented in fig. 13. 

EXAMPLE Vl: INDUCTION OF ANTIGEN EXPRESSION FROM pmTNF MPH Mt32 : 



A- MATERIAL AND METHODS 

DNA of pmTNF-MPH-Mt32 is transformed into E coli strain K12AH (ATGC 33767) using standard 
transformation procedures except that the growth temperature of the cultures is reduced to 28 C and the 
heat shock temperature to 34' C. 

A culture of K12AH harboring pmTNF-MPH-Mt32. grown overnight in Luria broth at 28 C with vigorous 
shaking in the presence of 10 ug/ml tetracycline, is inoculated into fresh Luria broth containing tetracyclin 
(10 ug/ml) and grown to an optical density at 600 nanometers of 0.2 in the same conditions as for the 
overnight culture. 

When the optical density at 600 nanometers has^ reached 0.2 half of the culture is shifted to 42 0 to 
induce expression while the other half remains at 28* 0 as a control. At several time intervals aliquots are 
taken which are extracted with one volume of phenol equHibrated -against M9 salts (0.1% ammonium 
chloride, 0.3% potassium dihydrogenium phosphate, 1.5% disodium hydrogenium phosphate, 12 molecules 
of water) and 1% SDS. At the same time, the optical density (600 nm) of the culture is checked. The 
proteins are precipitated from the phenol phase by addition of two volumes of acetone and storage 
overnight at -20 *C. The precipitate is pelleted (Biofuge A, 5 min., 13000 rpm. room temperature) dried at 
the air, dissolved in a volume of Laemmti (Nature (1970) 227 :680) sample buffer (+ jS mercapto ethanol) 
according to the optical density and boiled for 3 min. 

Samples are then run on a SDS polyacrylamide gel (15%) according to Laemmli (1970). Temperature 
induction of mTNF-Hise-Pas is monitored by both Coomassie Brilliant Blue (CBB) staining and immunoblot- 
ting. CBB staining is performed by immersing the gel in a 1/10 diluted CBB staining solution (0.5 g CBB- 
R250 (Serva) in 90 ml methanol : H2O (1:1 v/v) and 10 ml glacial acetic acid) and left for about one hour on 
a gently rotating platform. After destaining for a few hours in destaining solution (30% methanol, 7% glacial 
acetic acid) protein bands are visualised and can be scanned with a densitometer (Ultroscan XL Enhanced 
Laser Densitometer, LKB). 

For immunoblotting the proteins are blotted onto Hybond C membranes (Amersham) as described by 
Townbin et al (1979). After blotting, proteins on the membrane are temporarily visualised with Ponceau S 
(Serva) and the position of the molecular weight markers is indicated. The stain is then removed by washing 
in H2O. Aspecific protein binding sites are blocked by incubating the blots in 10% non-fat dried milk for 
about 1 hour on a gently rotating platform. After washing twice with NT buffer (25 mM Tris-HCI. pH 8.0; 150 
mM NaCI) blots are incubated with polyclonal rabbit anti-32-kDa antiserum (1:1000). obtained as described 
in example I ("screening of the Xgtll M. tuberculosis recombinant DNA library with anti-32-kDa antiserum") 
in the presence of E. coli lysate or with monoclonal anti-hTNF-antibody which crossreacts with mTNF 
(Innogenetics. n' 17F5Dl0) for at least 2 hours on a rotating platform. After washing twice with NT buffer + 
0.02% Triton.X.100, blots are incubated for at least 1 hour with the secondary antiserum : alkaline 
phosphatase-conjugated swine anti-rabbit immunoglobulins (1/500; Prosan) in the first case, and alkaline 
phosphatase conjugated rabbit anti-mouse immunoglobulins (1/500; Sigma) in the second case. 

Blots are washed again twice with NT buffer + 0.02% Triton XI 00 and visualisation is then performed 
with nitro blue tetrazolium (NBT) and 5-bromo-4-chioro-3-jndolyl-phosphate (BCIP) from Promega using 
conditions recommended by the supplier. 

B, RESULTS 

Upon induction of K12AH cells containing pmTNF-MPH-Mt32. a clearly visible band of about 35-kDa 
appears on CBB stained gels, already one hour after start of induction (Fig. 14a). This band, corresponding 
to roughly 25% of total protein contents of the cell, reacts strongly with anti-32-kDa and anti-mTNF antisera 
on immunoblots (Fig, 14b). However, this band represents a cleavage product of the original fusion protein, 
since a minor band, around 37 kDa, is also visible on immunoblots, reacting specifically with both antisera 
as well. This suggests that extensive cleavage of the recombinant mTNF-Hiss-Paz takes place about 2-3 
kDa from its carboxyterminal end. 
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EXAMPLE VII : PURIFICATION OF RECOMBINANT ANTIGEN ON IMMOBILIZED METAL ION AFFINITY 
CHROMATOGRAPHY (IMACT 

The hybrid protein mTNF-Hise-P32 (amino acid sequence, see fig. 13) expressed by K12AH cells 
containing pmTNF.MPH.Mt32. is especially designed to facilitate purification by IMAC. since the 6 
successive histidines in the polylinker sequence bring about a strong affinity for metal ions (HOCHULI et al, 
1988). 



a. Preparation of the crude cell extract : 

12 I of E. coll cells K12AH containing plasmid pmTNF-MPH-Mt32 were grown in Luria Broth containing 
tetracycline7lOltg/ml) at 28* C to an optical density (600 nm) of 0.2 and then induced by shifting the 
temperature to 42* C. After 3 hours of induction, cells were harvested by centrifugation (Beckman, JA 10 
rotor, 7.500 rpm. 10 min). The cell paste was resuspended in lysis buffer (10 mM KCl, 10 mM Tris-HC! pH 
6.8. 5 mM EDTA) to a final concentration of 50% (w/v) cells. 

6-NH2-capronic acid and dithiotreitol (DTT) were added to a final concentration of resp. 20 mM and 1 
mM, to prevent proteolytic degradation. This concentrated cell suspension was stored overnight at -70 C. 

Cells were lysed by passing them three times through a French press (SLM-Aminco) at a working 
pressure of 800-1000 psi. During and after lysis, cells were kept systematically on ice. 

The cell lysate was cleared by centrifugation (Beckman, JA 20. 18.000 rpm, 20 min. 4*0). The 
supernatant (SN) was carefully taken off and the pellet, containing membranes and inclusion bodies, was 
kept for further work since preliminary experiments had shown that the protein was mainly localised in the 
membrane fraction. 

7 M guanidinium hydrochloride (GuHCI, marketed by ICN) in 100 mM phosphate buffer pH 7.2 was 
added to the pellet volume to a final concentration of 6 M GuHCL The pellet was resuspended and 
extracted in a bounce tissue homogenizer (10 cycles). 

After clearing (Beckman, JA 20. 18.000 rpm. 20 min, 4' C). about 100 ml of supernatant was collected 
(= extract 1) and the removing pellet was extracted again as described above {= extract 2, 40 ml). 

The different fractions (SN,EX1,EX2) were analysed on SDS-PAGE (Laemmli, Nature 1970: 227:680) 
together with control samples of the induced culture. Scanning of the gel revealed that the recombinant 
protein makes up roughly 25% of the total protein content of the induced cell culture. After fractionation 
most of the protein was found back in the extracts. No difference was noticed between reducing and non- 
reducing conditions (plus and minus j9-mercaptoethanol). 

b. Preparation of the Ni* * IDA (Imino diacetic acid) column : 

5 ml of the chelating gel. Chelating- Sepharose 6B (Pharmacia) is washed extensively with water to 
remove the ethanol in which it is stored and then packed in a "Econo-column" (1 x 10 cm. Biorad), The top 
of the column is connected with the incoming fluid (sample, buffer, etc) while the end goes to the UV2sc 
detector via a peristaltic jump. Fractions are collected using a fraction collector and. when appropriate. pH 
of the fractions is measured manually. 

The column is loaded with Ni** (6 ml NiCb.eHsO; 5 ug/ul) and equilibrated with starting buffer (6 M 
guanidinium hydrochloride, 100 mM phosphate buffer. pH 7.2). 

.After having applied the sample, the column is washed extensively with starting buffer to remove 
unbound material. 

To elute the bound material. 2 different elution procedures are feasible : 

1) elution by decreasing pH. 

2) elution by increasing imidazol concentration. 
Both will be discussed here. 

To regenerate the column, which has to be done after every 2-3 runs, 20 ml (about 5 column volumes) 
of the following solutions are pumped successively through the column : 

- 0.05 M EDTA - 0.5 M NaCl 

- 0.1 M NaOH 

- H2O 

- 6 ml NiCl2.6H20 (5 mg/ml). 

After equilibrating with starting buffer the column is ready to use again. 
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c. Chromatography : 

All buffers contained 6 M guanidinlum hydrochloride throughout the chromatography. The column was 
developed at a flow rate of 0.5 ml/min at ambient temperature. Fractions of 2 ml were collected and. when 

5 appropriate, further analysed by SDS-PAGE and immunoblotting. Gels were stained with Goomassie Brilliant 
Blue R250 and silver stain, as described by ANSORGE (1985). Immunoblotting was carried out as 
described in example i. The primary antiserum used was either polyclonal anti-32kDa-antiserum (1/1000) 
obtained as described in example 1 ("screening of the \gt11 M. tuberculosis recombinant DNA library with 
anti-32kDa-antiserum") or anti-E. coli -immunogiobulins (1/500; PROS AN), or monoclonal anti-hTNF-anti- 

70 body which cross-reacts with mTNF (Innogenetics. 17F5D10). The secondary antiserum was alkaline 
phosphatase conjugated swine anti-rabbit immunoglobulins (1/500, PROSAN). or alkaline phosphatase 
conjugated rabbit-anti-mouse immunoglobulins (1/500. Sigma). 

75 C1. Elution with decreasing pH : 

Solutions used : 
A : 6 M GuHCI 100 mM phosphate pH 7.2 
B : 6 M GuHCI 25 mM phosphate pH 7.2 
20 C : 6 M GuHCI 50 mM phosphate pH 4.2 

After applying 3 ml of extract 1 (ODsao = 32.0) and extensively washing with solution A, the column is 
equilibrated with solution B and then developed with a linear pH gradient from 7.2 to 4.2 (25 ml of solution 
B and 25 ml of solution C were mixed in a gradient former). The elution profile is shown in figure 15. 

From SDS-PAGE analysis (Coomassie and silverstain) it was clear that most of the originally bound 
25 recombinant protein was eluted in the fractions between pH 5.3 and 4.7. 

Screening of these fractions on immunoblot with anti-32-kDa and the 17F5D10 monoclonal antibody 
showed that, together with the intact recombinant protein, also some degradation products and higher 
aggregation forms of the protein were present, although in much lower amount. Blotting with anti-E. coli 
antibody revealed that these fractions (pH 5.3-4.7) still contained immunodetectable contaminating E. coli 
30 proteins (75, 65, 43, 35 and 31 kDa bands) and lipopolysaccharides.. 



C2. Elution vith Increasing imidazol concentration : 

35 Solutions used : 

A : 6 M GuHCI 100 mM phosphate pH 7.2 

B : 6 M GuHCI 50 mM imidazol pH 7.2 

C : 6 M GuHGI 100 mM imidazol pH 7.2 

D : 6 M GuHCI 15 mM imidazol pH 7.2 
40 E : 6 M GuHCI 25 mM imidazol pH 7.2 

F : 6 M GuHCI 35 mM imidazol pH 7.2 

Sample application and washing was carried out as in CI, except that after washing, no equilibration was 
necessary with 6 M GuHCI 25 mM phosphate. The column, was first developed with a linear gradient of 
imidazol going from 0 to 50 mM (25 ml of solution A and 25 ml of solution B were mixed in a gradient 

45 former) followed by a step elution to 100 mM imidazol (solution C). During the linear gradient, proteins were 
gradually eluted in a broad smear, while the step to 100 mM gave rise to a clear peak (fig. 16). 

SDS-PAGE analysis of the fractions revealed that in the first part of the linear gradient (fr 1-24) most 
contaminating E. coli proteins were washed out, while the latter part of the gradient (fr 25-50) and the 100 
mM peak containedlnore than 90% of the recombinant protein. 

50 As in CI, these fractions showed, besides a major band of intact recombinant protein, some minor 
bands of degradation and aggregation products. However, in this case, the region below 24-kDa seemed 
nearly devoid of protein bands, which suggests that less degradation products co-elute with the intact 
protein. Also, the same contaminating E. coli proteins were detected by immunoblotting, as in Cl, although 
the 31 -kDa band seems less intense and even absent in some fractions. 

55 In a second stage, we developed the column with a step gradient of increasing imidazol concentrations. 
After having applied the sample and washed the column. 2 column volumes (about 8 ml) of the following 
solutions were brought successively onto the column : solution D, E, F and finally 4 column volumes of 
solution C. The stepgradient resulted in a more concentrated elution profile (fig. 17) which makes it more 
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suitable for scaling up purposes. 

In conclusion, the mTNF-Hise-Paa protein has been purified to at least 90% by IMAC. Further 
purification can be achieved through a combination of the following purification steps : 

- IMAC on chelating superose (Pharmacia) 

- ion exchange chromatography (anion or cation) 

- reversed phase chromatography 

- gel filtration chromatography 

- immunoaffinity chromatography 

- elution from polyacrylamide geL 

These chromatographic methods are commonly used for protein purification. 
The plasmids of figures 10b, 11b and 12b are new. 
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Claims 

1. Recombinant polypeptide containing in Its polypeptidic chain, one at least of the following amino acid 
55 sequences: 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
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constituted by amino acid at position (31) represented on fig. 3a and fig. 3b. or 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 3a and fig. 3b, or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 3a and fig. 3b. or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 3a and fig. 3b. or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 3a and fig. 3b. or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 3a and fig. 3b. or 

- the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

and the peptidic sequences resulting from the modification by substitution and/or by addition and/or by 
deletion of one or several amino acids in so far as this modification does not alter the following properties : 
the polypeptides react with rabbit polyclonal antiserum raised against the protein of 32-kDa of M. bovis 
BCG culture filtrate, and/or 

react selectively with human sera from tuberculosis patients and particularly patients developing an 
evolutive tuberculosis at an early stage, 

and/or react with the amino acid sequence extending from the extremity constituted by ammo acid at 
position (1), to the extremity constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

2. Recombinant polypeptide according to claim 1. containing in its polypeptidic chain, one at least of the 
following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. or 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 4a and fig. 4b. or 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 4a and fig. 4b. or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 4a and fig. 4b, or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 4a and fig. 4b. or 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 4a and fig. 4b. or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 4a and fig. 4b. or 

. the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

and the peptidic sequences resulting from the modification by substitution and/or by addition and/or by 
deletion of one or several amino acids in so far as this modification does not aiter the following properties : 
the polypeptides react with rabbit polyclonal antiserum raised against the protein of 32-kDa of M. bovis 
BCG culture filtrate, and/or 

react selectively with human sera from tuberculosis patients and particularly patients developing an 
evolutive tuberculosis at an early stage. 

and/or react with the amino acid sequence extending from the extremity constituted by amino acid at 
position (1). to the extremity constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

3. Recombinant polypeptide according to claim 1. containing in its polypeptidic chain, one at least of the 
following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5. or 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 5. or 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
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constituted by amino acid at position (120) represented on fig. 5, or 

- the one extending from tiie extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
5 constituted by amino acid at position (230) represented on fig. 5, or 

- the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (295) represented on fig. 5, 

and the peptidic sequences resulting from the modification by substitution and/or by addition and/or by 
deletion of one or several amino acids in so far as this modification does not alter the following properties : 
70 the polypeptides react with rabbit polyclonal antiserum raised against the protein of 32-kDa of M. bovis 
BCG culture filtrate, and/or 

react selectively with human sera from tuberculosis patients and particularly patients developing an 
evolutive tuberculosis at an early stage, 

and/or react with the amino acid sequence extending from the extremity constituted by amino acid at 
76 position (1). to the extremity constituted by amino acid at position (295) represented on fig. 5. 

4. Recombinant polypeptide according to claim 1 , containing in its polypeptidic chain, one at least of the 
following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on fig. 3a and fig. 3b, 

20 - the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-49) to to the extremity 
25 constituted by amino acid at position (-1) represented on fig, 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

30 ' the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
35 constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

• the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

40 - the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

5. Recombinant polypeptide according to claim 2, containing in its polypeptidic chain, one at least of the 
following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
45 by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (-1) represented on fig, 4a and fig 4b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

50 - the one extending from the extremity constituted by amino acid at position (-49) to to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
55 constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
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constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
5 constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

10 6. Recombinant polypeptide according to claim 3. containing in its polypeptidic chain, one at least of the 
following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (295) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (-43) to the extremity 
75 constituted by amino acid at position (-1) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (295) represented on fig. 5. 

- the one extending from the extremity constituted by amino acid at position (-43) to the extremity 
constituted by amino acid at position (295) represented on fig. 5. - 

20 7. Recombinant polypeptide according to claim 1. consisting in one of the following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by ammo acid at position (-1) represented on fig. 3a and fig. 3b. 

25 - the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
30 constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b, 

35 - the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
40 constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 3a and fig. 3b. 

45 - the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
50 constituted by amino acid at position (55) represented on fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 3a and fig. 3b, 

55 - the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 3a and fig. 3b. 
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- the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 3a and fig. 3b. 

8. Recombinant polypeptide according to claim 2. consisting in one of the following amino acid sequences: 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
5 constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-59) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

70 - the one extending from the extremity constituted by amino acid at position (-55) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-49) to the extremity 
75 constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (-47) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

20 - the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-42) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
25 constituted by amino acid at position (294) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (-29) to the extremity 
constituted by amino acid at position (-1) represented on fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (294) represented on fig. 4a and fig. 4b, 

30 - the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
35 constituted by amino acid at position (96) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 4a and fig. 4b, 

40 ' the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by amino acid at position (275) to the extremity 
constituted by amino acid at position (294) represented on fig. 4a and fig. 4b. 

9. Recombinant polypeptide according to claim 3. consisting in one of the following amino acid sequences: 
45 " the one extending from the extremity constituted by amino acid at position (-43) to the extremity 

constituted by amino acid at position (295) represented on fig. 5. 

- the one extending from the extremity constituted by amino acid at position (-43) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5. 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
50 constituted by amino acid at position (295) represented on fig. 5. 

- the one extending from the extremity constituted by amino acid at position (-30) to the extremity 
constituted by amino acid at position (-1) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (1) to the extremity constituted 
by amino acid at position (295) represented on fig. 5. 

55 - the one extending from the extremity constituted by amino acid at position (12) to the extremity 
constituted by amino acid at position (31) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (36) to the extremity 
constituted by amino acid at position (55) represented on fig. 5, 
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- the one extending from the extremity constituted by amino acid at position (77) to the extremity 
constituted by amino acid at position (96) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (101) to the extremity 
constituted by amino acid at position (120) represented on fig. 5. 

5 - the one extending from the extremity constituted by amino acid at position (175) to the extremity 
constituted by amino acid at position (194) represented on fig. 5. 

- the one extending from the extremity constituted by amino acid at position (211) to the extremity 
constituted by amino acid at position (230) represented on fig. 5, 

- the one extending from the extremity constituted by amino acid at position (275) to the extremity 
10 constituted by amino acid at position (295) represented on fig. 5. 

10. Amino acid sequences constituted by a polypeptide according to claims 1 to 9. and a protein or an 
heterologous sequence with respect to said polypeptide, said protein or heterologous sequence comprising 
from about 1 to about 1000 amino acids. 

* 11. Amino acid sequence according to claim 10, wherein the heterologous protein is /S-galactosidase. 

75 12. Nucleic acid comprising 

- a nucleotide sequence coding for anyone of the polypeptides according to claims 1 to 11. 

- or nucleotide sequences which hybridize with the nucleotide sequences coding for anyone of the 
polypeptides according to claims 1 to 11, 

- or nucleotide sequences which are complementary to the nucleotide sequences coding for any of the 
20 polypeptides according to claims 1 to 1 1 . 

- the above mentioned nucleotide sequences wherein T can be replaced by U. 

13. Nucleic acid according to claim 12, comprising one at least of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (182) represented in fig. 3a and fig. 3b, 

25 - the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
30 constituted by nucleotide at position (1358). wherein N represents one of the five A. T. C. G or I 

nucleotides, represented in fig. 3a and fig. 3b, 

or above said nucleotide sequences wherein T is replaced by U, 

or nucleic acids which hybridize with said above mentioned nucleotide sequences or the complementary 
sequences thereof. 

35 14. Nucleic acid according to claim 13, comprising one at least of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (182) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

40 - the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
constituted by nucleotide at position (1358). wherein N represents one of the five A. T, C. G or I 
nucleotides, represented in fig. 4a and fig. 4b. 

45 or above said nucleotide sequences wherein T is replaced by U, 

or nucleic acids which hybridize with said above mentioned nucleotide sequences or the complementary 
sequences thereof. 

15. Nucleic acid according to claim 13, comprising one at least of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
50 constituted by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
constituted by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1104) to the extremity 
constituted by nucleotide at position (1299). 

55 or above said nucleotide sequences wherein T is replaced by U, 

or nucleic acids which hybridize with said above mentioned nucleotide sequences or the complementary 
sequences thereof. 

16. Nucleic acid according to claim 13, comprising one of the following sequences: 
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- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (218) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (272) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. . 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

17, Nucleic acid according to claim 14, comprising one of the following sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by -nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
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constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (218) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (272) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1241) represented in fig, 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

18. Nucleic acid according to claim 15, comprising one of the following sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (129) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (l) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (l) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1299) represented In fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5, 
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- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1104) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5* 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5. 

19. Nucleic acid according to claim 13. consisting in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (182) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (218) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1 ) to the extremity constituted 
by nucleotide at position (272) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1 ) to the extremity constituted 
by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
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constituted by nucleotide at position (1358) represented In fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 3a and fig. 3b. 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
5 constituted by nucleotide at position (1358) represented In fig. 3a and fig. 3b, 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 3a and fig. 3b. 

20. Nucleic acid according to claim 14. consisting in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
70 by nucleotide at position (182) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (194) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (212) represented in fig. 4a and fig. 4b, 

75 - the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (218) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (l) to the extremity constituted 
by nucleotide at position (272) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at- position (1) to the extremity constituted 
20 by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

25 - the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (183) to the extremity 
30 constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

35 - the one extending from the extremity constituted by nucleotide at position (195) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
40 constituted by nucleotide at position (1241) represented in fig, 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (213) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

45 - the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (219) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
50 constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b. 

- the one extending from the extremity constituted by nucleotide at position (234) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

55 - the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (359) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1241) represented in fig, 4a and fig. 4b. 
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- the one extending from the extremity constituted by nucleotide at position (273) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1241) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (360) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b, 

- the one extending from the extremity constituted by nucleotide at position (1242) to the extremity 
constituted by nucleotide at position (1358) represented in fig. 4a and fig. 4b. 

21. Nucleic acid according to claim 15, consisting in one of the following nucleotide sequences: 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (129) represented In fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1 ) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1 ) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (1) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (219) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (90) to the extremity constituted 
by nucleotide at position (1299) represented in fig. 5. . 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (219) represented In fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1104) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (130) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
constituted by nucleotide at position (1104) represented in fig. 5, 

- the one extending from the extremity constituted by nucleotide at position (220) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5. 

- the one extending from the extremity constituted by nucleotide at position (1104) to the extremity 
constituted by nucleotide at position (1299) represented in fig. 5. 

22. Recombinant nucleic acid containing at least one of the nucleotide sequences according to claims 13 to 
21 , Inserted in a heterologous nucleic acid. 

23. DNA or RHA primer constituted by one of the following sequences: 
A (i) CAGCTTGTTGACAGGGTTCGTGGC 

A (ii) GGTTGGTGGCGCCGTGAGG 

A (iii) GGTCGCGGGGCTAGTGTGGG 

A (iv) GGGCGCCGGTCGGTGGCACGGCGA 

A (V) CGTGGGCGCGGCCCTAGTGTCGG 

B TGGGCCGCGGTGTACCTG 

C GGGCTGAGGCTGGCGATCTATC 

D CCGCTGTTGAACGTCGGGAAG 

E AAGCCGTCGGATCTGGGTGGCAAC 

F (i) ACGGCAGTGGGTGCCAGGCCCAAC 

F(ii) ACGGCCAACACCGGGCGCGCCGGA 

F (iii) ACGGGCACTGGGTGCCAGGCCCAAC 

F (iv) ACGGGCCAAGAGCGGGCCCGCGCGCCA 

24. DNA or RNA primer set constituted by any of the nucleotide sequences A(i), A(ii), A(iii), A(iv), A(v). B, G. 
D. E, F(i). F(ii), F(iii) or F(iv) in association with the complement of any other nucleotide sequences chosen 
from A. B, C, D, E, or F, A meaning any of the sequences A(i). A(ii), A(iii). A(iv), A(v) and F any of the 
sequences F(i), F(ii). F(iii) and F(iv), 

A(i), A(ii). A(iii). A(iv), A(v), B, G, D, E, F(i), F(ii), F(iii) and F(iv) having the meaning of claim 11, and 
advantageously constituted by the following elements : 
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A(i) 
or A(ii) 

g or A{iii) and the complement of B 

or A(iv) 
or A(v) 

70 



75 



20 



25 



30 



35 



40 



45 



55 
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A(i) 
or A(ii) 

or A(iil) and the complement of C 

or A(iv) 
or A(v) 

B and the complement of C 

A(i) 
or A(ii) 

or A(iii) and the complement of F 

or A(iv) 
or A(v) 

A(i) 
or A(ii) 

or A(iii) and the complement of D 

or A(iv) 
or A(v) 

A{i) 
or A(ii) 



or A(iii) 


and 


the 


complement 


of 


E 


or A(iv) 












or A(v) 












B 


and 


the 


complement 


of 


D 


B 


and 


the 


complement 


of 


E 


B 


and 


the 


complement 


of 


F 


C 


and 


the 


complement 


of 


D 


C 


and 


the 


complement 


of 


E 


C 


and 


the 


complement 


of 


F 


D 


and 


the 


complement 


of 


E 


D 


and 


the 


complement 


of 


F 


E 


and 


the 


complement 


of 


F. 



25. Recombinant vector, particularly for cloning and/or expression, comprising a vector sequence, notably of 
the type plasmid, cosmid or phage, and a recombinant nucleic acid according to anyone of claims 13 to 21. 
in one of the non essential sites for its replication. 

26. Recombinant vector according to claim 25, containing in one of its non essential sites for its replication 
necessary elements to promote the expression of polypeptides according to anyone of claims 1 to 12 in a 
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cellular host and possibly a promoter recognized by the polymerase of the cellular host, particularly an 
inductible promoter and possibly a signal sequence and/or an anchoring sequence. 

27. Recombinant vector according to claim 26, containing the elements enabling the expression by E coli 
of a nucleic acid according to anyone of claims 6 to 9 inserted in the vector, and particularly the elements 
enabling the expression of the gene or part thereof of )9-galactosidase. 

28. Cellular host which is transformed by a recombinant vector according to anyone of claims 25 to 27. and 
comprising the regulation elements enabling the expression of the nucleotide sequence coding for the 
polypeptide according to anyone of claims 1 to 12 in this host. 

29. Cellular host according to claim 28. chosen from among bacteria such as E. coli , transformed by the 
vector according to claim 25. or chosen from among eukaryotic organism, transformed by the vector 
according to claim 25. 

30. Expression product of a nucleic acid expressed by a transformed cellular host according to anyone of 
claims 28 or 29. 

31. Antibody characterized by the fact that it is directed against a recombinant polypeptide according to 
anyone of claims 1 to 12. 

32. Nucleotidic probes, hybridizing vith anyone of the nucleic acids according to claims 13 to 21 or with 
their complementary sequences, and particularly the probes chosen among the following nucleotidic 
sequences 

Probes A(i) A(ii). A(iii) and A(iv) 

A ( i) CAGCTTGTTGACAGGGTTCGTGGC 

A(ii) GGTTCGTGGCGCCGTCACG 

A (iii) CGTCGCGCGCCTAGTGTCGG 

A (iv) CGGCGCCGTCGGTGGCACGGCGA 

A (V) CGTCGGCGCGGCCCTAGTGTCGG 

Probe B 

TCGCCCGCCCTGTACCTG 
Probe C 

GCGCTGACGCTGGCGATCTATC 
Probe D 

CCGCfGTTGAACGTCGGGAAG 
Probe E 

AAGCCGTCGGATCTGGGTGGCAAC 
Probes F(i) and F(ii) 

F (i) ACGGCACTGGGTGCCACGCCCAAC 
F {ii)'ACGCCCAACACCGGGCCCGCCGCA 
F (iii) ACGGGCACTGGGTGCCACGCCCAAC 
F (iv) ACGCCCCAACACCGGGCCCGCGCCCCA 
or their complementary nucleotidic sequences. 

33. Process for preparing a recombinant polypeptide according to anyone of claims 1 to 12 comprising the 
following steps: 

- the culture in an appropriate medium of a cellular host which has previously been transformed by an 
appropriate vector containing a nucleic acid according to anyone of claims 12 to 22. and 

- the recovery of the polypeptide produced by the above said transformed cellular host from the above said 
culture medium. 

34. Method for the in vitro diagnostic of tuberculosis in a patient liable to be infected by Mycobacterium 
tuberculosis comprising the following steps: 

- the possible previous amplification of the amount of the nucleotide sequences according to anyone of 
claims 12 to 22, liable to be contained in a biological sample taken from said patient by means of a DNA 
primer set according to claim 24. 

- contacting the above mentioned biological sample with a nucleotide probe according to claim 32. under 
conditions enabling the production of an hybridization complex formed between said probe and said 
nucleotide sequence. 

- detecting the above said hybridization complex which has been possibly formed. 

35. Method for the in vitro diagnostic of tuberculosis in a patient liable to be infected by Mycobacterium 
tuberculosis comprising 

- contacting a biological sample taken from a patient with a polypeptide according to anyone of claims i to 
11, under conditions enabling an in vitro immunological reaction between said polypeptide and the 
antibodies which are possibly presentin the biological sample and 
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- the in vitro detection of the antigen/antibody complex which has been possibly formed. 

36. Methoid"for the in vitro diagnostic of tuberculosis in a patient liable to be infected by ML tuberculosis , 
comprising the following steps : 

- contacting the biological sample with an appropriate antibody according to claim 31, under conditions 
5 enabling an in vitro immunological reaction between said antibody and the antigens of M. tuberculosis 

which are possibly present in the biological sample and 

- the in vitro detection of the antigen/antibody complex which may be formed. 

37. Neceisary or kit for an in vitro diagnostic method of tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis according to claim 34, comprising 

10 - a determined amount of a nucleotide probe according to claim 32, 

- advantageously the appropriate medium for creating an hybridization reaction between the sequence to be 
detected and the above mentioned probe, 

- advantageously, reagents enabling the detection of the hybridization complex which has been formed 
between the nucleotide sequence and the probe during the hybridization reaction. 

75 38. Necessary or kit for an in vitro diagnostic method of tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis iccording to claim 35, comprising 

- a polypeptide according to anyone of claims 1 to 11, 

- reagents lor making a medium appropriate for the immunological reaction to occur, 

. reagents enabling to detect the antigen/antibody complex which has-been produced by the immunological 
20 reaction, said reagents possibly having a label, or being liable to be recognized by a labeled reagent, more 
particularly in the case where the above mentioned polypeptide is not labeled. 

39. Necessary or kit for an in vitro diagnostic method of tuberculosis in a patient liable to be infected by 
Mycobacterium tuberculosis according to claim 36, comprising 

- an antibody according to claim 31, 

25 - reagents for making a medium appropriate for the immunological reaction to occur. 

- reagents enabling to detect the antigen/antibody complexes which have been produced by the im- 
munological reaction, said reagents possibly having a label or being liable to be recognized by a label 
reagent, more particularly in the case where the above mentioned antibody is not labeled. 

40. Immunogenic composition comprising a polypeptide according to anyone of claims 1 to 11, in 
30 association with a pharmaceutically acceptable vehicle. 

41. Vaccine composition comprising among other immunogenic principles anyone of the polypeptides 
according to claims 1 to 11 or the expression product of claim 30, possibly coupled to a natural protein or 
to a synthetic polypeptide having a sufficient molecular weight so that the conjugate is able to induce in 
vivo the production of antibodies neutralizing Mycobacterium tuberculosis, or induce in vivo a cellular 

35 immune response by activating M. tuberculosis antigen-responsive T cells. 

42. Process for the enzymatical amplification of a nucleotide seance according to claims 12 to 22. and 
detection of the amplified nucleotide sequence, wherein 

- the amplification is achieved by PGR technique by means of a primer set and the detection of the PGR 
amplified product is achieved by a hybridization reaction with a detection probe constituted by an 

40 oligonucleotide sequence of at least 10 nucleotides, said sequence being located between the two PGR 
primers which have been used for amplifying said nucleotide sequence. 

- the primer set and detection probe used being preferably chosen among the following elements: 
Primer set 

PI gagTacctggaggtgcggtggcggtggatgggccg 

45 P2 compl. GTACCAGTGGAACGGGGGGGTGTTGAT 
Probe B 

TCGCCCGCCCTGTAGCTG 
Primer set 

pTgagtacgtgcaggtggcgtggccgtcgatgggccg 

50 P3 compl. TCCCACTTGTAAGTCTGGCA 
Probe B 

TCGCCGGCCCTGTACCTG 
Primer set 

P1 gagTacgtggaggtgccgtggcggtcgatggggcg 

55 P4 compl. CGGCAGGTCGCTGGTCAGGA 
Probe B 

TCGCCCGCCCTGTAGCTG 
Primer set 
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PI GAGTACCTGCAGGTGCCGTCGCCGTCGATGGGCCG 
P5 compl. GCGTCACGCATGGCCAGGCCGATCAGG 
Probe B 

TCGCCCGCCCTGTAGCTG or 
5 Probe 0 

GCGCTGACGCTGGGGATCTATC 
Primer set 

PrGAGTAGCTGCAGGTGCGGTGGCCGTGGATGQGGCQ 
P6 compL GCGCCCCAGTACTCCCAGCTGTGCGT 
10 Probe B 

TCGCCCGCCCTGTAGCTG or 
Probe C 

GCGCTGACGCTGGCGATCTATC or 
Probe D 

15 CCGCTGTTGAACGTCGGGAAG or 
Probe E 

AAQCCGTCGGATCTGGGTGGCAAC 
Primer set 

P27rrCAACACCCCGGCGTTCGAGTGGTAC 
20 P5 compl. GCGTCACGCATGGCCAGGCCGATCAGG 
Probe C 

GCGCTGACGCTGGCGATCTATC 
Primer set 

P2ATCAACACCCCGGCGTTCGAGTGGTAC 
25 P6 compl. GCGCCCCAGTACTCCCAGCTGTGCGT 
Probe C 

GCGCTGACGCTGGCGATCTATC or 
Probe D 

CCGCTGTTGAACGTCGGGAAG or 
30 Probe E 

AAGCCGTCGGATCTGGGTGGCAAC 
Primer set 

P3 TGCCAGACTTACAAGTGGGA 

P5 compl. GCGTCACCCATCGCCAGGCCGATCAGG 

35 Probe C 

GCGCTGACGCTGGCGATCTATC 
Primer set 

P3 TGCCAGACTTACAAGTGGGA 
P6 compl. GCGCCCCAGTACTCCCAGCTGTGCGT 
40 Probe C 

GCGCTGACGCTGGCGATCTATC or 
Probe D 

CCGCTGTTGAACGTCGGGAAG or 
Probe E 

45 AAGCCGTCGGATCTGGGTGGCAAC 
Primer set 

pTtccTgaccagcgagctgccg 

P5 compl. GCGTCACCCATCGCCAGGCCGATCAGG 
Probe C 

50 GCGCTGACGCTGGCGATCTATC 
Primer set 

P4 tccTgaccagcgagctgccg 

P6 compl. GCGCCCCAGTACTCCCAGCTGTGCGT 
Probe C 

55 GCGCTGACGCTGGCGATCTATC or 
Probe D 

CCGCTGTTGAACGTCGGGAAG or 
Probe E 
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AAGCCGTCGGATCTGGGTGGCAAC 
Primer set 

P5CCTGATCGGCGTGGCGATGGGTGACGC 

P6 compl. GCGCCCGAGTACTCCCAGCTGTGCGT 

5 Probe D 

CCGCTGTTGAACGTCGGGAAG or 
Probe E 

AAGCCGTCGGATCTGGGTGGCAAC 

or the primer set being preferably chosen among the primer sets according to claim 24, and the detection 
10 probe being constituted by any oligonucleotide sequence of at least 10 nucleotides, said sequence being 
located between the two PCR primers constituting the primer set which has been used for amplifying said 
nucleotide sequence. 

43. A vector sequence forming part of a recombinant vector according to claim 25, said vector sequence 
having either the nucleic acid sequence represented in fig. 10b, or the nucleic acid sequence represented 

75 in fig. 1 lb. or the nucleic acid sequence represented in fig. 12b. 

44. Plasmids comprising either the nucleic acid sequence of fig. 10b. or the nucleic acid sequence of fig. 
1 1b. or the nucleic acid sequence of fig. 12b. 

45. Peptides of claim 1. advantageously used to produce antibodies, particularly monoclonal antibodies and 
which have the following amino acid sequences : 

20 

Amino acid Amino acid 

position position 
25 (NH2-terminal) (COOH-terminal) 





12 


QVPSPSMGRDIKVQFQSGGA 


31 




36 


LYLLDGLRAQDDFSGWDINT 


55 


30 


77 


SFYSDWYQPACRKAGCQTYK 


96 




101 


LTSELPGWLQANRHVKPTGS 


120 




175 


KASDMWGPKEDPAWQRNDPL 


194 


35 


211 


CGNGKPSDLGGNNLPAKFLE 


230 




275 


KPDLQRHWVPRPTPGPPQGA 


294 




77 


SFYSDWYQPACGKAGCQTYK 


96 




276 


PDLQRALGATPNTGPAPQGA 


299 



40 



55 
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Figure 9a 



PROBE REGION A 



AT6 ICAGCTTGTTGACAG6GTTCGTG6C6CCGTCACGG6TATGTCGC6TCGACTC 
III 



ATG CAGCTTGTTGACAGGGTTCGTGGC6CCGTCACG6GTATGTC6C6TCGACTC 



II II I I I I I 
ATG ACAGACGTGAGCCGAAA6ATTCGAG CTT 



MM I II I 
6GGGACGCCG ATT6A TG 



55 
55 
49 

110 
107 
101 



6TGGTCGGGGCC6TCGGC6CGGCCCTAGTGTC6GGTCTGGTCGGCGCCGTCGGTG 



GT6GTCGGGGCCGTC GCGCG CCTAGTGTCGGGTCTGGTCGGCGCCGTCGGTG 

I I I III I II I I nil mil II I Ml I 

ATCGGCACGGCAGCG GCTGT AGTCCTTCCGGGCCTGGTGGGGCTTGCCG6CG 



PI 



6CAJCGGCGACCGCGGGGGCATTTTCCCGGCC6GGCTTGCCGGTG 
Ml 



GCAlCGGCGACCGCGGGGGCATTTTCCCGGCCGGGCTTGCCGGTG 
-I MM Mill III M II I Mill Mil I II II III 
GAG C6GCAACCGCG6GC6CGTTCTCCCGGCCGGGGCTGCCG6TC 



GAGTACCTG 



GAGTACCTG 



GAGTACCTG 



163 
160 
154 

217 
214 
208 



CAGGTGCCGTCGCCGTCGATGG6CCG 



CAGGTGCCGTCGCCGTCGATGGGCCG 



CAGGTGCCGTCGCCGTCGATGGGCC6 



TGACATCAAGGTCCAATTCCAAAGTGGT 



TGACATCAAGGTCCAATTCCAAAGTGGT 



CGACATCAAGGTTCAGTTCCAGAGCGGT 



PROBE REGION B 



GGTGCCAAC TCGCCCGCCCTGTACCTG CTCGACGGCCTGCGCGCGCAGGACGA 



GGTGCCAAC ITCGCCCGCCCTGTACCTG I CTCGACGGCCTGCGCGCGCAGGACGA 
II MM II II II i ii III IIIIIIIIIIIMIIII II Mill 
6GGAACAAC TCACCTGCGGTTTATCTG CTCGACG6CCTGCGCGCCCAAGACGA 
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Figure 9b 



270 


CTTCAGCG6CTGGGAC 


267 


CTTCAGCGGCTGGGAC 




II II IIIMIIII 


261 


CTACAACGGCTGGGAT 



P2 

ATCAACACCCCGGCGH^ 

ATCAACACCCCGGC^^ 
ATCAACACCCCGGCGTTCGAGTGGTAC 



GACCA6TCGG 
GACCA6TCGG 
TACCAGTC6G 



323 
320 
314 

378 
375 
369 



GCCT6TCGGTGGTCATGCCGGTGGGTGGCCAGTCAAGCTTCTACTCCGACTGGTA 



GCCT6TCGGTGGTCATGCCGGTGGGTGGCCAGTCAAGCTTCTACTCCGACT66TA 

i mill I mil mill ii ii iim iiiiiim iiiiiiiii 

6ACTGTC6ATAGTCAT6CCGGTCGGCG66CAGTCCAGCTTCTACAGCGACTGGTA 



CCAGCCCGCCTGCGGCAA6GCCG6T 



CCA6CCCGCCTGCCGCAAGGCC6GT 

I II Mini I I 

CAGCCCGGCCTGCGGTAAGGCTGGC 



P3 




P4 


T6CCAGACTTACAAGTG6GA 


GACCT 


TC 
II 


TGCCAGACTTACAAGTGG6A 


GACCT 
III 


TC 
II 


TGCCAGACTTACAAGTGGGA 


AACCC 


TC 



484 
481 
475 



GGGTGGCTGCAGGCCAACAGGCACGTCA^^ 

GG6TGGCTGCAGGCCAACA6GCACGTCAAGCCCACC 
III II IIIIIIIII III IIIIIIIII 
CAATGGTTGTCCGCCAACAGGGCCGTGAAGCCCACC 

PROBE REGION C 

GGAAGCGCCGTCGTCGGTCTTTCGATGGCTGCTTCTTCG[6laGEG2MCG 



430 


CT6ACCAGCGAGCTGCCG 


427 


CTGACCA6CGAGCTGCCG 


421 


CTGACCAGCGAGCTGCCG 



r,f:^j^r.rr,rrRTrr,TrnRTr.TTTr.RATRr,nTRrTTrTTnG l6CGCTGACGCTGGCG 
II mil I Mil i llllllll I II III II III UN 
GGCAGCGCT6CAATCGGCTTGTCGATGGCCGGCTCGTCG GCAATGATCTTG6CC 
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Fisur* 9c 



SSmSIacccccagca^^ 

tfctiraacccccagcagttcgtctacgcgggagcgatgtcgggcctgttggac 



GCCTACC ACCCCCAGCAGTTCATCTACGCCGGCTCGCTGTCGGCCCT6CT6GAC 

P5 



CCCTCCCAGGCGATG6GTCCCAC 
CCCTCCCAGGCGATGGGTCCCAC 
CCCTCTCAGGGGATGGG 



CCTGATCG6CCTGGCGATGG6TGACGC 
CCTGATCGGCCTGGCGATGGGTGACG^ 
CCTGATCGGCCTC6CGATGGGTGACGC 



TGG 



TGG 
II 
C6G 



CG6CTACAAGGCCTCCGACATGTGGGGCCCGAAGGAGGACCC6GCGTG6CAGC6C 
1 1 II II I Ml I II III III 1 1 1 1 Ml 1 1 1 II II 1 1 1 III 1 1 II II 1 1 1 1 1 1 1 1 1 1. 
CGGCTACAA6GCCTCCGACATGTGGGGCCCGAAGGAGGACCCGGCGTG6CAGC6C 

III lllllll mill I IINllll Ml Mill 

CGGTTACAAGGCCGCAGACATGTGGGGTCCCTCGAGTGACCCGGCATG66AGCGC 



PROBE REGION D 
AACGAC ICCGCTGTTGAACGTCGGGAAGI CTGATC6CCAACAACACCCGC6TCTG 



AAC6AC ICCGCTGTTGAACGTCGGGAAGI CTGATCGCCAACAACACCCGCGTCTG 



mill II I II II III III MM lillMIIIII I 
AACGAC CCTACGCAGCAGATCCCCAAG CTGGTC6CAAACAACACCC6GCTATG 

PROBE REGION E 
GGTGTACTGCGGCAACGGC lAAGCCGTCGGATCTGGGTGGCAACl AACCT6CCGG 



GGTGTACTGCGGCAACGGC lAAGCCGTCGGATCTGGGTGGCAACl AACCTGCCGG 



Ml II IIMI llllll I III II MM M I III I M 
GGTTTATTGCGGGAACGGC ACCCCGAACGAGTTGGGCGGTGCC AACATACCC6 
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Figure 9d 
Figure 9e 



CCAA6TTCCTCGA6G6CTTCGTGCGGACCA6CAACATCAAGTTCCAA6ACGCCTA 



CCAAGTTCCTCGAGGGCTTCGTGCGGACCAGCAACATCAA^^ 
CC6AGTTCTTGGAGAACTTCGTTCGTAGCAGCAACCTGAAGTTCCAGGATGCGTA 

P6 

CAACGCC6GTGGC66CCACAACGGC6TGTTCGACTTCCCGGACAGCGGT' 



CAACGCC6GTGGGCGCCACAAC6GCGTGTTC6ACTTCCCGGACAGCG6T 
IN III II I lllilll iilllll IMIIIII II III 
CAA6CCCGC6GGC66GCACAACGCC6TGTTCAACTTCCCGCCCAAC6GC 



ACGCA 
ACGCA 
ACGCA 



CA6CTGGGAGTACT6GGGCGC 
CAGCTGG6A6TACT6GGGCGC 
CAGCTG6GAGTACTGGGGC6C 



GCAGCTCAACGCTATGAAGCCCGACCTGCA [aT 



GCAGCTCAACGCTATGAAGCCCGACCTGCA [AC 
TCAGCTCAACGCCATGAAGGGTGACCTGCAGAG 



PROBE REGION F , 

GGGCACTGGGTGCCACGCCCAACACCGGGCCCGCGCCCCAGGGI CGCCTAG 



GGCACTGGGTGCCACGCCCAACACCGGGCC CGCCGCAGGG I CGCCTAG 



I I II III II I 

TTCGTTAGGCGCC GGCTGA 
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Abstract 

The invention concerns a method for producing a species specific tuberculosis diagnostic from 
an extract or a centrifuged precipitate of a killed culture of Mycobacterium strains, 
advantageously Mycobacterium bovis or Mycobacterium avium, being purified from lipoids by 
lipid solvents, characterized by separating the individual protein-polysaccharide complexes, 
obtained from the concentrated filtrate of the culture or from an extract obtained by acidic 
hydrolysis of the bacterium precipitate, by electrophoresis or by adjusting the pH to correspond 
to the isoelectric point of the individual fractions, and eliminating the protein from the fraction 
comprising the polysaccharide portion providing the desired species specific tuberculin or 
serological reaction by cleaving the protein-polysaccharide bond, then separating the pure 
species specific polysaccharide by precipitation or gel filtration and converting it into a 
diagnostic reagent. 



